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ABSTRACT: 

PURPOSE:To obtain a gene of a newly elucidated hepatitis C virus (HCV), a specific 
oligonucleotide, a method for detection thereof and a method for determining the genotype of the 
HCV. 

C0NST!TUT10N:The polynucleotide has a base sequence described in sequence Nos. 1 to 5. 
The oligonucleotide is described in sequence. No 6. The methods for detecting an HCV gene and 
determining the genotype use the polynucleotide and oligonucleotide as a primer, a probe, etc. 
Furthermore, the polypeptide is described in sequence Nos. 15 to 19. Thereby, the HCV gene of 
the nevMy found genotype can be detected and the HCV genotype Can simultaneously be 
determined over a wide range. 



{19)Q*S1$3lF;r (JP) 



(12) ^ H 4# 1^ ^ ^ (A) 



iNFii¥6-319563 



(SDlnta' 
C 1 2 N 15/51 
C0 7K 7/08 
13/00 

C 1 2 P 21/02 



ZNA 



8318-4H 
8318-4H 
8214-4B 
9050-4B 



F I 



C 1 2 N 15/00 



m^m(osc22 mm (± 4i h) smM\zm< 



(mmma 



^H¥5-147133 
¥«5¥(1993) 5^I3B 



(7I)aiHA 391039391 

*B^ST«5aaP51Silp5«1560-25 
«S:iRfeMKi?S4 TB28Sl4-70m 



(57) 

[B«i SrSic»?g$nfccieffFjfe'>'f;i';^ chcv) 

ffl-rsitt)»C. j£V>«HIC|^t>THCVjae-7-l!?£=PJ 



J I 



c 


E 






163 


IGC 


ffiS 






e 


e 


K 


i 


k ■ 



h d f h J I B 

a OitHeO) ; bCotS-BO ; c(nt]3Me06) : d(iTt13]0-iaS7) : a CntlTga-SGO) 
1 OrtSOafr^} ; k (ptm29-'X]66) ; l 0Tt7D23-7833) ; m (ptmt-aBXi) 



1 

NA • HC-G9, 

• 5' . 

«««;iieSE?lJ*^-r53t«U5{i'U:t?FYS 1 1 7 

• 3 ' . 10 

Cif^JB4] fi5!is^4E«<o:esffi5iJ. Sfctt;:nt 

ffi«W;5:SSe5>J!£^-rS3lf J ? ^ H S R 0 3 7 

• 5' . 

«««&«SE5lJS:#f 53i? U 5? i' H S R 0 3 7 

• 3' . 

?i'U':t5^H#3 2 1. 
[9*3® 9 ] E J^IS^ 6 ^<om.WSM S^r-r 5 :t 'J =f 

imim. 1 0 ] E^j#^ 6 mLOimMm^^^t^^ u 

£U:*^ y^-r ■7-'>;7. 

m^m 1 1 ] E^ijs^ 6 fm<r>m.wsM^^t^ 30 

:J ? ^ U:t5^ H:/7-fT-SifcH:t 'J rf 5C H y 

m. 

[af«ai 3] iil*]Sm7 3®;5:ViLmi 03gig3«©:tU 

14] 'J ;l 7-1^5^X1- > 'J 7^ i/3 >»IC 
«iE«OCSiffF3Si"?'r;VXja<5-Tl!!pJSffi*fc«a:«3B 

<i<i$ff^w*«si 3JSgB«©csiFf3ft'>'f;uxae 
^tttfissfctt^ 1 4aga«©cs!ffF3ft^;'r;i/X5tf5T 

(iJ*®l 6] il^am6®&tiLM9aS®(Dgilil>' 50 



!|$M¥6-3 1 9 5 6 3 

2 

a-zf. 

im^mi 81 &m»m 5mm(D^u^-:r^^ khc 

-G9Peotein. ^ii=>Zf\Z^(0{$^'^-^i''f F. 

1 9 ] E^J#^ 1 6 13«©i}« >J '^y^' H Y S 
117-5' Peptide. &6lXC-?-<Dfi5»'^:/^' 

[il«312 01 EJOS^l 7|BtEOAS>J'<:/3"f KYS 
117-3' Peptide. ift^mc^CDSg^i''^:/^ 
-f H. 

[a^JB2 1] E^JS^l SWm<0:^i}^-:f^-i HSR 
037-5' Peptide. 7S:6Wc:^©g5»'^7'^' 

C»««2 2] EJ^JS^l 9iae®3j<'J'^y5''f HSR 
037-3' Peptide, &^t;»C-e®a5iJ"<y^' 

CO 0 0 1] 

T fHCVj tiSIB-r-S) rocDNASe^. ^:n?£«l 
C00021 

C?£3(E©ft'fg] 19 8 S^tCHC Vlie^©-g5*i«p?g 

^ti^s^nxfiofe. Hcv^»f'vffl>£;fflpiii7asc^< 
®K^SdiBi%5n^iBfl:$nT#it. i:n*-e. hcv 

matkizi:^ Tai«ifiL?8t4> lcS*3tx-5 H C VtrifrSttHl-r 

tttfl-rs::^SAt|i553ni2:<ffli»e.nTS;t)0<, set 
izm-)-^mm.^<DmM^^\t, m^-'^mt^mtK^m 

G^EJ*>J«-5l'iH1$SffiIR®<#fei:^-tllcS^< ^»rfe 
:3ViT(ijteT-©±E?iW^»W$nT*3 0. * e Jcffi© 

ttcict?v»TttaG^E5ijo-«5*«»?g$n. Hcvi^a 

jte^ ©E?iI©!|$:g*-5 t»ttH C V1#S7 5 y KEJIJ© 
^^IC^IJffl^nfc, ■?-©ite:S. fie3l5©«Scj£lCit'<. c 

ne©tSffiicScJv^TiB%$nfcSja©^9f>£tSigs^^# 

mf3im^:^u^um-vt^jzo\zrti)}t\:tf)rz. la^ 
L. ffi;^. j:ne©«^3^ffi!£fflt»fc«'&-ct). i^afeMts 

[0 0 0 3] 
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3 

*McDff S L T v> ^ <0;^i S)Oi 6 75: l^^o 
[0 0 041 

(D^gH C V<^«iatTFi+iMCiI$S^#tt ^n^t^^H C V 

\z^mu'&&^m\\^^^'r^Lth\z. z.<r>mm-^^ 
r S:^- U rf ^ ^57 5" H 1 c ^tci$sw;^S3?rj^&irr 

•r s c i (c J: 0 T-a(D«iitT®eTScDfij^75«^^T 

^7 V-t^ \^^XXS^^) Yii^^fs.^:/^ 

^T(Z><7>gS;51* :3 T t):^g<£r§tt/^ C L \tm^(0 Z. 
^ ^ Y\Zl^\.Xl^OWW^^m^^ifs\) 5^ ^ U:t 

9- Yit^xs\z:^')ziy. ^ }^hmm^mm(Dmm\z 
[0 0 0 51 rr3it>^. *%mit<^m<oHCY t}tm:f3: 



> #ffl¥6-3 19 56 3 

4 

4ISS^J<£^r-r^2SU^^ ^ KS;?S:fJ:r U z[y^!7U 

CO 0 0 61 iiffi0»iiic^t-j;^tc. se 

KHCV^£HC-G9i^«U ^0 CO 2«i#(C-::?t.^T 
fl^<D-gP(OEJ?iJ<£4$5£U, ^«i<*:SS 1 1 7. SRO , 
3 7 t A« L;to ^(O^SSB^JtiE^J#^ 1 L 5 IE 

m\zM.^nrzT7. huy^'fs^^^m'^r 944 om<DM 

)Sl^T9 0 3 Z^m^ibU^ 3 0 1 17^/K$:3 

[0 0 0 7] ^i&m2\zmrj:v\z:$i^m 

Mtc^-r^ 1 4^co«Sffi??iJ.h^li:aELfc, ^<Di^^. 

±m^(Dm&mmt^m0»{ik'Tmmmt<D?^\z\t2 0 

^1^J?9 L T V> ^ H C V <h h^im-CSb ^niL LTzi&bkTm 

tLx^^m-^n^ct^n.thbfzo ^§im^^\t:^^m 
\zib^i!)^^mm0Hcv(om^^m^^mmz\ cmt^ 

^Lfz. 

[0 0 0 8] i^^mm^\t. ^mm2\z7nr^o\zic 
mncN^(DmB^^mM^\zmMt^'&B^miAt 
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(4) 

5 

[0 0 0 9] HCVoag^tttBfeiC^ 
iB«©^a<D:t-U nfj? KCffiJUa 1 c SJCfe J: 

<«??$nT:feD. j:n*JHcv©sfe?«m*«i:y5i 
CO 0 1 0] i^mm-v\m^<r)mm-)5mt.\^xif^^) :^ 

7— tf. yx-fX 'J7i'->3>ffi (PCRS) ^StS 

coo 1 1] pcRfetcftvsTtt. m-WL'^thx^m 

\Z^m-r^»mi3.t 'J :^ W:t5^ F'^T- 1 l/Ttt, 

Lxm^^^ 6 u ^ v-t^ y- tm\m^ 9 

Co 0 1 2] Sfc. *%M®^ij£icJ;ntf*5!?8©SrS J» 
LiB^#^ 1 4 ® §KffliaeTl!#SW:t U =f 3? W 

-fv-tsiBi^cffifflu pcRji*-5ii«4ia0 2a 
CO 0 1 3] *%w^^ttii»©«itca*2.^ 

CO 0 1 4] *%?gcDJKU'^y3''f FlC*ttS75y» 40 

:/ (E 1) i5J:i;E2/Ns iTtt-e<offiBi±*«<£<. 

i!1^JIW-CJ65e:t&^bTV>-5. El. E2/NS1 

•5. Ltzifl-oX, ^j^^gojj^'j^y^-f H*3±r;-e®3? 
i>:tUrf'^yi5''1' Hai cS!i|^SW(cln;x>'^D-:r$i[ 

*fcNS2~NS5MJg{ti7'nT-7— 5» 
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6 

;fS:Si«^«iliJ^-?'HC VlijBfflSa!I©IJB58JcfflViS d i:*t 
CO 0 1 5] 

CO 0 1 6] 

{^mm\ BIT, *5l9J©IISS«^tc-:3tiT»]^'-i-5*t. 
t> <i: J: 0 c:ne.©^«1Cag^$n-5 t)©T ti 

[00 17] 

fi£*©afi-?StC»IB$*TJ&:Viffil!c©SrffiH C Vt*5£^ 
au. ^©^:SSffi$iJ*5J;tX-g5©iaSBB5iJ<&*©J;3 

[0 0 18] (1) RNA©ttaj 
*BR® H C V*i^**i2E3l*<fcJ/*55??« ^ ic J: o Tffl^ 
$ni|#lfUS4'©6i«^ttlEffi (1$e¥2 - 1 5 3 4 0 

(i|#ra¥5-2 3 2 0 0) iCiOHCV^SSiCitKKSn 

vjae^iJ'Pi^icBa-rs;??^ (#H¥3 - 3 o 729 

6, 4-09396 0) -t^yX^O^m.^^^^^ifiX^IS. 
1)^-:>TzWA&^^^<nia.mm^ (HC-G9, YS 1 1 

7, :fcJ;CXSR0 3 7) A>6;>:©cfc5lCLTRNASffl 
WLfc. ifilfSS Om I CiSaa© h U XSfffR (1 Om 
M, pH8. 0) SaoA. 9 0 X 1 0 3 r pmtCTl 5 

^m<D^-L-iim^n^Tz, #e.nfc'^U'!yKc2oom 

M©NaCI. 1 OmMEDTA. 2% (ffifi/^ffiO 
©Kx->;HSSe:J-F'Jr?A (SDS) <h1mg/ml© 
:/Px:^— tfKS^trh'JxeasffS (5 OmM, pH 

8, 0) siin^. 6 O'er i^mjnjau. 7x/-;i'/' 

VsRNASIIfc. 
[0 0 19] (2) cDNA©f^S 

&^ft«^J:o^5fcRNAS7o*CTl^^l^ll^ollL.m ci 

n*il?&L»5!RNAtLfc. C©»S!RN A-lJ->:/Jl' 
(C 1 0 OO-X-y h©jS»1£^ffll* (Superscrlp 
t;GIBCO. BRL) 43<i;U::t U UtT? 

^'fv-2 0 pmo 1 sfli^. 4 21:;. mimsiEJ^E-a- 

TcDNAS:^fc. 

[0 0 2 0] (3) cDNA©#<J^7— lf9^x-f>U 
7^'->3> (PGR) c:i-3i§<i 

±ga©«fiiicj:o#6nfcm^cDNAic-s)tir. si 




(5) 
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C^-rffi«9J(C. ^ 1 \Z7F:T&mmmzWf^Lrcir.>X *&[ScIence. Vol. 239. p487-49 



E3?5*#: 1 
WM<r>^^ : 94S7 

«<?)a : 

il5(I<^>SS : c DNA to genomic RNA (HC-G9 COHA) 

6CCAGCCCCC TGATGeGGGC GACACTCCGC CATGAATCAC TCCCCTGTfiA G6AACTACTG 60 
TCTTCACGCA 6AAA6CGTCT AGCCATGGCG TTAGTATGAG T6TC6T6CAG CCTCCAGGAC 120 
CCCCCCTCCC GGGAGAfiCCA TAGTGGTCTG C6GAACCGGT GAGTACACCG 6AATT6CCAG 180 
GACGACCGGG TCCTHCTTG GATTAACCC6 CTCAATGCCT GGAGAITTGG GC6TGCCCCC 240 
6CAAGACTGC TAGCC6AGTA GTGTTGG6TC GCGAAAGGCC TTG7G6TACT GCCTGATAGG 500 
GT6CTT6CGA GTGCCCC6GG AGGTCTCGTA GACCSTGCAC CATGA6CAC6 AATCCTAAAC 360 
CTCAftAGAAA AACCAAACGT AACACCAACC GCCGCCCACA GGACGTTAAG HCCCGGGIG 420 
GCGGCCAGAT CGTTGGCG6A GTTTACTTGT T6CCGC6CA6 6G6CCCCAGA 6TG6GTGTGC 430 
GC6CGAC6AG 6AAGACTTCC GAGCGGTCGC AACCTCGCGG GAGGC6TCAG CCTATTCCCA 540 
AGGCCCGCCG ACCC6AG6GA AGGTCCTGG6 CGCA6CCCGG GTACCCTrOG CCCCTCTAT6 600 
GCAACGAGG6 CTGTGGGTGG GC6GGATGGC FCCTGTCCCC CCGCGGCTCT C6GCCTAGTT 660 
GG6GCCCTTC TGACCCCCGG CGGAGGTCAC GCAATTrGGG TAAGGTCATC GATACCCTCA 720 
CGT6T6GCTT CGCCGACCTC ATGG6GTACA TCCC6CTCGT C66CGCTCCT CTAGGGGGC6 730 
CTGCCAGAGC TCTG6CACAT 6GTGTTAGAG TCCTG6AAGA CGGCGTGAAT TAC6CAACAG WO 
6GAACCTCCC CGGirGCTCT TTTTCTATCT TCTTGCTCGC TCTTCTATCC TGCCTGACA6 900 
TCCCTGCTTC GGCCGTC66A GTGCGCAACT CTTCG6GGGT GTACCATGTC ACCAATGATT 960 
GCCCCAAIGC GTCC6TTGTG TACGAGACGG AGAACCT6AT CATGCATCTG CCCGGGTGTG 1020 
IGCCCTACGT ACGCOAGGGC AACGCCTCGA G6TGTTCGGT CTCCCTTAGr CCCACCGTAG 1080 
CC6CCAGG6A TTCGCGCGTC CCCGTCAGTG AGGfTCGGCG TCGTGTC6AC TCGATT6TCG 1140 
GGGCCGCTGC GTTCTGTTCG GCTAT6TATG TAGGGGACCT ArCCGGCTCC ATCrTCCITG 1200 
TTGGCCAGAT CTTCACCTTC TCTCCCAG6C ACCATTGGAC GACGCAAGAC rGCAATFGCT 1260 
CCATCTACCC AGGCCATGT6 ACAGGTCATC GAATGGCTTG G6ACATGATG ATGAATTGGT 1320 

[0 0 2 3] (4) cDNA7'f:/7'J-ro«I^ICj;-2) 'J j^i^U^t-^HA-f^-X (New England 
HC-G9, YS 1 1 7, :feJ:t;SR0 3 TWSSS^lJ Biolabs),T4 DN Aif'J j^C^—- 1 (T a k 



97— (Perkin-Elmer. Cetus)tG 
ene Amp DNAiiit@aii=*^s/ h (P e r k 1 n 
-Elmer-Cetus) Sffll-^TS a i k 1 



1 (1 9 8 8) ] iz^-Dr3 5m<Dmm'^^i^j\^ifi6y3: 

[002 1] 

mi] 

[0022] 



ara Biochemicals) "CfflSft. M 1 3 
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Si5£(id i de oxy chain terminat 
ionffilCT, S equencenase sequ 
encekit ver 2, 0 (United Sr 
are B i o c h emn i c a 1 s) ^S^^ttAu t 
oRead Sequencing kit (Pha 
rmac i a) ^m^'^Xfrr^lt. ^m^\Z^\^^X. 

»fflUTE^^»^Lfco HC-G9lc:3vmi±®« 
YS 1 1 7;ia:5WcSR0 3 TtC-^V^TttS' igJ: 
0 6 3#SCDffiS30>e 1 8 6 7#a(DtaS^r<OE5(l^ 
8 2 5 9#B?5^S9 4 4 OSB^rcDE^^HC-^i^iTSt^ 
hit. fi5q^llCHC-G9, S?^S2{c:YS 1 1 7, 
©5' ffil. ffiJ?r[^3tCYS 117. ©3' ffl. E3^m4 
tCSR037cr)5'fil. ffi?fli^5fCSR0 3 7.(DmMt 
m(OZ' fflJcD&iSSE^^&^-ro ^2lCHC-G9. Y 
S 1 1 7, S R 0 3 7a)ffiMraoffll^tt^^To -e<Oite 



(6) 1$S¥6-3 1 9 5 6 3 

2 icHc - G 9 cns7?(r^«oiess?rw^»?gsn 

H c - G 9 fiii-rncoKaofie? S«c©i6S 

it. C:(DC^75>SHC-G9$^ty3tt*|c43K(DHCV 
cne«l a;S:6l^tCl btCJSi^nTViS 

ft. 

[0 0 24] 
[0 0 2 5] 

m2] 
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CACCCACTGG CGCCTTAGT6 CTGGCACAOC TACTCCGGAT CCCACAA6CT ATCGT6GATA U80 
rOATAGCTGG TGCCCACTGG GGT6TCCTA6 CG6GCCTGGC ATACTACTCC AT6GTG6GGA 1440 
ACTGGGCTAA GGTTGT6GTC GTGCTGCTGC TCnCGCIGG C6TT6AC6CA GAGACCCGfiG 1500 
TCACAGGGfiG GGCCGCrGGC CACACC6CGT TCGGGTTTGC TAGCTTCCTC GCCCCAfiGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA AC6GCA6CTQ GCACATCAAC AGAACTGCCT 1620 
T6AACTGTAA TGAAAGCITG GATACT6GCT GGCTAGCAGG GCT6CTCTAC TACCACAAGT 16S0 
TCAACTCCTC A6GGTGTCCC GAGAG6ATGG CIACTTGCCA ACCTCnACC GCCTTCfiACC 1740 
AAGGGT6GGG ACCCATCACT CAC6AGGGGA ATGCTAGTGA TGACCAGCGfi CCATATTGn UOO 
G6CACTAT6C CCTAC6CCCG 7GT6GCATTG TGCCAGCGAA AAAGGniGC GGGCCTGTAT U60 
ACTGTTTCAC ACCCAGCCCC GT66TA6TGG G6ACGACGGA CAGAGCCCfiC GTTCCTACCT 1920 
ACAGA76GG6 TGCCAATGAG ACGSATGTAC TGCTCCTCAA CAACTCTAQ6 CCGCCAATQG 1980 
GGAAHGOTT TGGGTGTACG TGGAT6AATT CTAGTC6CTT CACCAAGACG TGCG66GCTC 2040 
C6GCCTGCAA CATCGGCfiGG AGC66QAACA ATACCCTGCT GT6CCCAACA GATTGCTTCC 2100 
GTAAACATCC GGATfiCCACA TACAGCAGGT GCGGCTCTGG TCCCTGGCTT ACCCCTCGAT 2160 
CCTTGGTA6A CTACCCATAC A6GCTCTGGC ACTACCCCTG TACA6TCAAT TACACCATTT 2220 
TCAAGATCAG MTGnTGTG GGCGfiGGTTG AGCACAGGCT TGACGCCGCG FGCAACTCGfl 2280 
CGCfiGGGAGA GCGCTGCSAT TTGGACQACA GGGATCGGGC CGA6TTQAGC CCTCT6TTGC 2340 
TGTCCACTAC GCAAT6GCAG GTCCTCCCCT 6CTCATTCAC AACACTGCCC GCCCTGTCAA 2400 
CTfiGCCTGAT ACATOTCCAC CAGAACATCG TfiGACGTGCA GTACCTCTAT GG6TTGAGCT 2460 
CGGCAGTCAC ATCCTG66TC ATAAAGTGGG AfiTACGTTGT GCTCCTCnc TTGCTGCTAG 2520 
CAfiAFGCTCG CAmCTCCC TGCTT6TQGA TGATGCTTCT CATATCTCAG aTAGAflflCQfl 2580 
CGCT6GAGAA CTTGATfiGTT CTCAACGCTG CTTCCCTAGT CGG6ACACAT GGCAICGICC 2640 
CCTTCTTCAT CTnniTGT GCAGCTTQGT ACCFAAAAGG CAAGTGGGCC CCTGGAC7CG 2700 
CCTATTCCGT CTATQG6ATG IGGCCACTGC TCCFGCTTCT CC7GGCGTTG CCCCAACGGG 2760 
CA7ACGCC77 6GA7CAGCAG 77GGCC6CG7 CG7G7GGG6C CACCGTC7TC A7CTGCC7AG 2820 
C6G7GC7CAC 7CTA7C6CCA 7A7TACAAAC A67ACA7GGC CCGCGGCA7C 7GGrGCC7GC 2880 
AG7ACA7GCT 6ACCAGAGCA GAGGC6C7CC 7ACAGG777G GG7CCCCCCG C7CAACGCCC 2940 
GACGAG6GCG C6ACGQAGTC G7AC7GCICA CG7G7G7GCT CCACCC6CAC TTGC7CnTG 3000 
AAA7CACCAA GA7CA7GC7G GCCA77C7C6 G6CC777G7G GA7C7TGCAG GCCAG7C7GC 3060 
TCAAGG7ACC 67AC77C67G CGTGCCCAC6 G7C7CA77AQ GCTC76CA7G C7Gfi7GCGCA 3120 
AGACAGCCGC CCG7CAG7AT GTGCAGAFGG C7C7677AAA fiC7GGGAGCA T77GCCQGCA 3180 
[0 0 2 6] Ilig«<f2 40 S«:ftjteT»rM-®i91@lc®U:tffi?iJ*^T-5:^-'J:l5i 

(1) 1 cl!HCV®3ae?S!«^ICiaU^£:7'7'fT- aP^ffljSfii^igitiC^effl-rSit'J rf?^? u^f^Hy^i- 
was? T-i:UTfi5l]S7:feJ;0«?lJS8iC32iRO/t# 1 8 6 

icstf^. 1 cSHcvic»TSfe«as3sw<j:<«i¥ 1 0 4 t*«?^co#3 2 1 {m\m) -fy^-^-^m 

$nT*01.t?|E#®ae^i!©HCV«ci«ffl|^tt*l #186. # 2 5 6;fte.mc# I 0 405sf'J zT 

Lrz^:^i&i^\m<m^mtwimz:i rm^h^z. -^nx^ n mmw- 3-307296. 4#h¥4 - 0 9 
tmmhfc. z.n\zm-:i^. ^rmmj^r) \ cm\z<^ so 3 9 e o) ^acojae^MfijjtffiTfiEfflsnTt^s^'j 
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13 

[0 0 2 7] (2) *5gM<^# 3 2 1 ^^LfcHCV 

#ae^S!(OHCV«c (HC-J 1 : lag, HC-J 
4: I IS. HC-J6: I I IM. HC-J8: IV 
M) fXiE>lf\ZHC-G9^M<DRUAtf^iE>. #18 6(0 

<£:#2 5 6;^i^WC#l 8 6 <0:/^-1"T-^£:«Efflb:feP 
CR^&fiJfflLTl|-gPgCDii®^fTt3:/bo M-aPgcOP 
CRti, 9 4X:\ZX^^^\^. 5 SriCjl^y^-f^ 

-^^SiSi^J^ao^. 7 2t:{cj;^y^-fv-#g^ 

l£;2^r>^l-+f-f ^'^V^LS 5lJ'^^;PfTt5?to ^-gpg 

E^J#^1 1-1 4|SS<0#2 9 6> #13 3. #13 
4. # 1 3 5:fe^t;*^ig<0#3 2 ly^-f T-, Tie 

^ 9 o #io47'^-rT-$ffli.^)tPCR \zxmm 

a$mz:SPgjii|i(Dif>:/;i/i:LTPCR^fTt>fco S 
fS^^mtd 4V\Z^^^^KfS^3 0^. 6 0t:(C<i:^y 
^-f T-iK^S>S3 0|J>> 7 2'C<l:D;i^y^-rv-# 
gS;S3 0g>T, SKJS;;&>e;i^ l1J-^^;U^&^lh3 0 



(8) »ffl¥6-3 1 9 5 6 3 

[5i»ojgL?5:« m^mmj^. mmmm^i. 5%<o 

NuSieve^eWcl. 5%(7>SeaKem (FM 
C Bioproducts, U. S. A) ^m^^fcT 

m.^^t>^4 9. 1 4 4. 1 7 4. 1 2 3. 2 0 0b 

p (IS. I m. 1 1 IS. I vs. icS) ofiftt 
fz. ^Tc. ^^i^om^^mmzit^m^n^^f^m 
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CCTACAITTA CAACCACCTT TCCCCGCFCC AAGACTGG6C TCACAGCGGT CTGCGCGACC 3240 
TGGCCGTAGC CACTGAACCC 6TCATCTTCT CCCGGATGOA AATCAAGACT ATCACCTG6G 3300 
GGGCGGATAC TGC6GCTT6T G6A6ACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3380 
6GACAGA6GT GTTGCrGGGA CCAQCCGATG CCCT6ACTGA CAAAGCATGG AG6CTTTTAG 3420 
CCCCCATCAC GGCTTAC6CC CA6CAGACAC GG66TCTCTT SGGCTGCATC ATCACCAGCC 34B0 
TCACC6GTCG G6ACAAAAAT CAAGTGGAGG G6GAA6TCCA QAITQTGTCT ACC6CAACCC 3540 
A6AC6T7CTT GGCTACnGT GTTAATGQAG TTTGCTGGAC T6TGTATCAT Q6G6CCGGAT 3600 
C6AGGACCAT CGCTTCGfiCQ TCG6GCCCT6 TGATCCAGAT GTACACTAAT GTGGACCAGG 3660 
ATTTG6TGG6 CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT 6AC6CCG7GC ACAT6C6GCG 3720 
CCTCGGATCT GTACTTGGTC ACGAGGCACG CGGACGTCAT TCCA6TGCG6 CGTCGG66CG 3780 
ATAACAGGGG AA6TTTACTA TCTCCCCGGC CAATTTCATA TCTAAAG6GA TCCTCGGGAG 31)40 
GCCCCCT6CT CTGTCCCATfi GGACATfiCCG TGGGCATTTT CAGGGCCQC6 GTGTGCACCC 3900 
GTGfifiGTCGC AAAGGC6GTC GACTTTflTGC CCGTTGAATC CCTA6AGACC ACCATGAGGT 3960 
CCCCAGrCTT TACC6ACAAT TCCA6CCCTC C6ACAGTGCC CCAGAGCTAC CAGGTGGCGC 4020 
ATCTGCflCGC TCCCACTGGA AGTGGTAAGA GCACGAA6GT GCCGGCCGCC TATGCGGCTC 4080 
AAGGGTACAA G6TTCTTGTG CTGAACCG6T CTGTTGCT6C CACCCTAGG6 TTCGGCGCTT 4140 
ATATGTCAAA GGCCCATGG6 ATTGACCCAA ACGTCAGQAC TGGCGTAAGG ACCAFTACCA 4200 
CAGGCTCCCC CATCACCCAC TCCACCTACG GCAAATTCCT GGCTGAC6GT GGGTGTTCAG 42S0 
GAGGTGCGTA TGACATCATA ATATGTGACG AAT6TCACTC AGTGGAC6CC ACCTC6ATTC 4320 
TAGGCATAfiG 6ACTGTCTTG GACCAAGCGG AGACA6CGGG GGTTAGGCTC ACTATCCTCO 43*0 
CCACC6CTAC ACCACCTGGC TCCGTCACCG fGCCACATTC CAACATCGA6 GAAGTTGCAT 4440 
TGTCCACT6A G6G6GAGATA CCAFTCTATG GTAAG6CCAT CCCCCTAAAT TACATCAAGG 4500 
GGGQGAfifiCA TCTCATTTTC TGTCATTCCA AGAAGAAGFG CGACGAGCTC GCTGCAAAGC 4560 
TGGTTGGCCT GGGCGTCAAC GCA6TGGCCT TTTACCGCGG CCTCGACGTG TCTGTCATCC 4620 
CAACCACAGG AGACGTCCrT GTTGTGGCGA CCGACGCCTI AAIGACTGGC TACACCG6CG 4630 
ATTICGAC7C CGTGATAGAC TGCAACACCT 6TGTCGTCCA GACAGTCGAT TFCAGCCTAG 4740 
ACCCTACATT CTCTATTGAG ACTTCCACCC T6CCCCAGGA CGCCGfGTCC CGCTCCCAAC 4800 
GG4GAGGTAG AACCGGTCGG GGGAAGCAT6 GTATCTACAG ATATGTGTCA CCCGGGGAGC 4860 
GGCC6TCTGG CATGTTT6AC TCCGTG6TCC TCTGTGAG16 CTATCACQCG GGTTGTGCTT 4920 
GGTATGAGCT TACACCCGCC GAGACCACGG ITA6GTTACG SGCAIATCTT AACACCCCAG 4980 
GGTTGCCCGT GT6CCAGGAC CACTTGGAGT TTTGGGAGAG CGTCTTCACC GGCCrCACCC 5040 
[0 0 3 1] 40 t!W!£ICfiJffl-?*S. *5eqgiC<fcO»f/tlca:3£Snft::t- 

=t<r>ik^t.m^v^)v<Dmm^\zmm-c'^^. sfcJi [0032] 
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J7 18 

m^^^ : 1 

WM<7)-^^ : 9 487 

mn^m : 

EJIlOa^ : c DNA to genomic RNA (HC-G9 cONA) 

GCCAGCCCCC TGATGeGG<5C GACACTCCGC CAFGAATCAC TCCCCTGTGA 66AACTACT6 60 
TCriCACGCA 6AAA6CGTCT AGCCATfifiCG TTA6TATGAG T6TCGTGCAG (XTCCAG6AC 120 
CCCCCCT(X;C GGGAGAGCf^ TAGT6GTCTS CG6AAC(»6T GAGTACACCG GAATT6CCAG 180 
6ACGACCG6G TCCTTTCTTG 6ATTAACCC6 CTCAAT6CCT GGAGATTTGG GC6TGCCCCC 240 
6CAAGACTGC TAGCCGAGTA GTGTTSGGTC GCGAAAG6CC TTGTGGTACT GCCTGATAGG 300 
GT6CTT6CGA GTGCCCCGG6 A6GTCTCGTA 6ACCGTGCAC CATGA6CACG AATCCTAAAC 360 
CTCAAAGAAA AACCAAACGT AACACCAACC GCCGCCCACA 66ACGTTAA6 ITCCCGGGTG 420 
GC6GC(JAGAT C6TTGGCGGA GTTTACTTGT TGCCGC6CAG GGGCCCCAGA GTGGGT6TGC 4S0 
6CGCGAC6AG 6AAGACTTCC GAGCGGTCGC AACCTC6CGG GAGGCGTCAG CCTATT(JCCA 540 
AGGCC(M(X;G ACCCGAGGGA AGGTCCTGGG CGCAGCCCGG GTA(X:CTrG6 CCCCTCTAT6 600 
GCAACSAGGG CTGTG66TG6 GCGG6ATGGC rCCTGT(iCCC CCGCGGCTCT CGGCCTAGTT 660 
6GGGCCCTTC TGACC(XCGG CGGAGGTCAC GCAATTTGGG TAAGGTCAFC GATACCCTCA 720 
CGTGTGGCTT CGCCGACCTC AT6GGGTACA TC(XGCTCGT C66CGCTCCT CTAG6GGGC6 780 
CTGCCA6A6C TCTG6CACAT 6GTGTTAGAG TCCTGGAA6A CG6C6TGAAT TACQCAACAG 840 
66AACCTCCC CGGTTGCTCT TTTTCTATCT TCTTGCTCGC TCTTCTATCC TQCCTGACA6 900 
TCCCTGCTTC GGCCGT(X5GA 6TGCGCAACT CTTCGGGGGT 6TACCATGTC ACCAATGAIT 960 
GCC(X:AArGC GTCC6TTGTG TACGAGACGG ACAACCTGAT CATGCATCTG (XCG6GTGTG 1020 
IGCCCTACGT ACGCGAG6GC AACGCCTC6A GGTGTTGGGT CTCCCFTAGT CCCACCGTAG 1080 
CCGCCAGGGA TTCGC6CGTC CCC6TCA6TG AGGfTCGGCG TCGTGFCeAC TCGA7TGTCG 1140 
GGfiCCGCTGC GTTC7GTTCG 6CTATGTAT6 TAGGGGACCT ATGCGGCTCC ATCTTCCITG 1200 
TTfiGCCAGAT CTTCACCTTC TCTCCCAG6C ACCATTGGAC 6ACGCAAGAC FGCAAirGCT 1260 
CCATCTACCC AG6CCATGTG ACAGGTCATC GAATGGCTTG GGACATGATG ATGAATTGGT 1320 
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CACCCACTGG CGCCTTAGT6 GT6GCACAGC TACTCCGGAT CCCACAAGCT ATCGTGGATA 1380 
TGATAGCTGG TGCCCACTGG GGTGTCCTA6 CGGGCCTGGC ATACTACTCC ATGGTGfiGGA 1440 
ACTG6GCTAA GGTTGT6G7C GTGCTGCT6C TCnCGCIGG C6TT6AC6CA GAGACCCGGG 1500 
TCACAGGGfiG SGCCGCTSGC CACACC6CGT TCGG6TTIGC TA6CTTCCTC GCCCCAGGCG 1560 
CTAAGCAAAA GATCCAGCTC ATAAATACCA ACGGCAGCTG GCACATCAAC AGAACTfiCCT 1620 
TGAACTGTAA TGAAAGCrTG 6ATACT6GCT GGCTA6CAQG GCTGCTCTAC TACCACAAGT 1M0 
TCAACTCCTC AG6GTGTCCC 6AGASGATG6 CTAGTTGCCA ACCTCnACC GCCT7CGACC 1740 
AAGQGTGG66 ACCCATCACT CACGAQGGGA ATQCTA6TGA TGACCAGCGG CCATATTGH 1300 
GfiCACTATGC CCTAC6CCCG TGT6GCATTG TSCCAGCfiAA AAASGniGC GGGCCT6TAT 1360 
ACTGTTTCAC ACCCAGCCCC GTGGTAGTGG GGACGACGGA CAeAGCCGGC GTTCCTACCT 1920 
ACAGATGGee TGCCAATGAG ACGGAT6TAC TGCTCCTCAA CAACTCTAGfi CCGCCAATGG 1980 
GGAAnCGTT TGGGTGTACG T6GATGAATT CTAGTGGCn CACCAAGAC6 TGCGGGGCTC 2040 
CG6CCTGCAA CATC6GCG6G AGCG6GAACA ATACCCTGCT GTGCCCAACA GATT6C7TCC 2100 
GTAAACATCC GGATfiCCACA TACA6CAGGT GCGGCTCTGG TCCCTG6CTT ACCCCTCGAT 2160 
GCTTGGTAGA CTACCCATAC AGGCTCTGGC ACTACCCCTG TACAGTCAAT TACACCATTT 2220 
TCAAGATCAG GATGniGTG GGC66GGTTG A6CACAG6CT TGAC6CCGCG TGCAACTGGA 2280 
CGCGGGGAGA GCGCT6CSAT TTG6ACGACA GQGATCGG6C CGAfiTTGAGC CCTCT6TTGC 2340 
TGTCCACTAC GCAAT6GCAG GTCCTCCCCT GCTCATTCAC AACACTQCCC GCCCTGTCAA 2400 
CTGfiCCTGAT ACATCTCCAC CAGAACATCG TGGACG7GCA GTACC7C7AT G6G77GAGCT 2460 
CGGCAG7CAC A7CC7GG67C A7AAAG7GGG AG7ACGT7eT GC7CCTCnc TTGCTGC7AG 2S20 
CAGArGC7CG CA777G7CCC 7GC77G7(SGA 7GArGCT7Cr CA7ArC7CA6 G7AGAQ(3CQQ 2S80 
CGC76GAGAA C77GA7AG7T C7CAAC6C76 C77CCC7A67 C6G6ACACA7 GGCATCG1CC 2640 
CCTTC77CA7 C777777767 GCAGC77GG7 ACCFAAAAGG CAAGT6GGCC CC7GGAC7CG 2700 
CCTATTCCGT CTATQGGA76 7GGCCAC7GC 7CC7GC77C7 CC7G6CG77G CCCCAACGGG 2760 
CA7ACGCC77 GGA7CAGCAG T7GGCC6CG7 C67G7G6CGC CACGG7CrrC A7CrGCC7AG 2820 
CGG7GC7CAC 7C7A7CGCCA 7A7TAaAAC AG7ACA7CGC CCGCGGCA7C 76Gra;C7GC 2880 
AG7ACATGC7 GACCAGA6CA GAGGC6C7CC 7ACAGGT77G GG7CCCCCCG C7CAACGCCC 2940 
GAGGA6GGCG C6ACGGAGTC G7AC7GCrCA CG7G7G7GC7 CCACCC6CAC TTGCTCn76 3000 
AAA7CACCAA GA7CA7GC7G GCCA77C7Ce GfiCCT7TGTG GA7C776CAG 6CCAGTC7GC 3060 
7CAAG6TACC G7AC77CGTG CG7GCCCAC6 G7C7CAnAG GC7C7GCATG C7G6rGCGCA 3120 
AGACA6CCGC CGG7CA67AT GrGCAGA7GG C7C7Gr7AAA GCTGGGAGCA 77TGCCQGCA 3180 
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CCTACAITTA CAACCACCTT TCCCCGCTCC AAGACTGGCC TCACAOCGGT CTGCGCGACC 3240 

TC6CG6TAGC CACTGAACCC 6TCATCTTCT CCCfiOATGCA AATCAAGACT ATCACCTGGG 3300 

GGGCG6ATAC TGCSGCTTGT 66A6ACATCA TCAACGGGCT GCCTGTTTCC GCCCGGAGAG 3360 

GGA6AGA6GT GTTGCT66GA CCAQCCGATG CCCT6ACTGA CAAAfiGATGG AGGCTinAG 3420 

CCCCCATCAC GGCTTACGCC CAGCAGACAC GGGGTCTC7T GGGCTGCATC ATCACCAGCC 3480 

TCACCGGTCG G6ACAAAAAT CAAGTGGAGG QGGAAGTCCfl 0ATTQT6TCT ACCGCAACCC 3540 

AGACGTTCTT GGCTACnST GTTAAT6QA6 niGCTGGAC TGTGTATCAT QGGGCCGGAT 3600 

CGAGGACCAT CGCTTCGfiCQ TCGGQCCCT6 TGATCCAGAT 6TACACTAAT GTG6ACCAGG 3660 

ATTTGGTGG6 CTGGCCAGCG CCTCAGGGAG CGCGCTCCCT 6AC6CC6TGC ACATGCGfiCG 3720 

CCTCQ6A7CT GTACHGGrC ACGAGGCACG CGGACGTCAF TCCAGTGCG6 CGTCG6GGCG 3740 

ATAACAGGGG AAGTTTACTA TCTCCCCGGC CAATTTCATA TCTAAAG6GA TCCTCGGGAG 3540 

6CCCCCTGCT CTGTCCCAT6 GGACAT6CC6 TGGGCATTTT CAGGGCCGCG 6TGTGCACCC 3900 

CTfifififiTCfiC AAAG6CGGTC GACTTTfiTGC CCGTTGAATC CCTAGAGACC ACCATGAGST 3960 

CCCCAGrGTT TACCGACAAT TCCAGCCCTC CGACAGTGCC CCAGAGCTAC CAGGTGGCfiC 4020 

ATCT6CACGC TCCCACTGGA AGTG6TAAGA GCACGAAGGT GCCG6CCGCC TAT6C6GCTC 4080 

AAGGGTACAA GGTTCTT6T6 CTGAACCCGT CTGTTGCTGC CACCCTAGGG TTCG6CGCTT 4140 

ATATGTCAAA GGCCCAT66G ATTGACCCAA ACGTCAGGAC TGGC6TAAGG ACCAFTACCA 4200 

CAGfiCTCCCC CATCACCCAC TCCACCTACG 6CAAATTCCT 6GCTGACGGT GGGTQTTCAG 4260 

GAGGTGCfiTA TGACATCAM AfATGTGACG AATGTCACTC AOrGGACGCC ACCTCGATTC 4320 

TAGGCATAGG GACT6TCTTG GACCAAGCGG A6ACAGCGGG GGriAGGCTC ACTATCCTCG 4380 

CCACCGCTAC ACCACCT6GC TCCGTCACCG TGCCACATTC CAACATCGAG GAA6TTGCAT 4440 

TGTCCACT6A GGGGGAGATA CCAFTCTATG GTAA6GCCAT CCCCCTAAAT TACATCAAGG 4500 

6GGGGAGGCA TCTCATTTTC TGTCATTCCA AGAAGAAGFG CGAC6AGCTC GCTGCAAA6C 4560 

TGGTTGGCCT GGGCGTCAAC GCA6TG6CCT TTTACCGCSG CCTCGACGTG TCTGTCATCC 4620 

CAACCACAGG AGACGTCCrT GTTGTGGCGA CCGACGCCTT AATGACTGGC TACACCGGCG 4680 

ATTTCGACTC CGTGATAGAC TGCAACACCT 6TGTCGTCCA GACA6TCGAT TrCAGCCTAG 4740 

ACCCTACATT CTCTATTGAG ACTTCCACCG TGCCCCAG6A CGCCGIGTCC CGCTCCCAAC 4800 

GGAGAGSTAG AACCGGTCGG GQGAAGCATG 6TATCTACAG ATATGTGTCA CCCGGGGAGC 4860 

GGCCSTCTGG CATGTTT6AC TCCGT66TCC TCTGTGAGIG CTATOACGCG GGTTGTGCTT 4920 

GGTATGAGCT TACACCCGCC GflGACCACGG TTAGGTTACG 6GCATATCTT AACACCCCAG 4980 

GGTTGCCCGT GTGCCAGGAC CACTTGGA6T TTTGGGA6AG CGTCTTCACC GCCCTCACCC S040 
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ACATAGATGC CCACTTCCTG TCTCAGACGA AACAGAGC6G GGAAAAHTC CCCTACCTAG 5100 
TC6CATACCA AGCCACCGTG T6CGCTA6AG CTAAAGCTCC TCCCCCGTCA TGGGACCAAA 5160 
TGTG6AA6T6 CTTGATACGG CTCAAGCCCA CCCTCACTGG 6GCTACCCCC CTACTATACA 5220 
GACTGQGTGG rGTGCAGAAT GAGATCACCC TAACACACCC AATCACCAAG TACATCATGG 5280 
CnGTATGTC GGCTGACCrG GAGGTCGTCA CTAGCACGTG GCTGCTGGTG G6CGGCGTCC 5340 
TGGCCGCTTT SGCCGCTTAC TGCCTSTCTA CAGGCAGC6T GGTCATAGTG 6GCAG6ATAA 5400 
TCCrAAGCGG 6AAG(X:GGCA GTCATTCCTG ACAGGGAGGT TCTCTACCGA fiAGTHGATG 5460 
AGATGGAAGA fiTGCGCCG(« (^CATCCCCT ACCHGAGCA GGGGATGCAT TTGGCT6AAC 5520 
AGTTCAAGCA GAAAQCTCTC GG6TTGCTCC AGACAGaTC CAAGCAAGCA fiAGA(XATCA 5580 
CTCCCQCTQT CCATACCAAT TQQCAflAAAC TCGAQTCCTT CTflOflCTAAQ CACATQTOGA 5640 
ACTTCGTCAG C6GGATACAA TACCTGGCGG GCCTGTCAAC GCTGCCCGGT AATCCCGCTA 5700 
TA6C6TC6CT GATGTC6TTT ACAGCCGCGG TGACGA6TCC AC1AACCACC CA6CAAACCC 5760 
TCCTCTTTAA CATCCTTGM QGGTG66TGG CC6CCCAGCT TGCCGCCCCA GCTGCCGCCA 5820 
CTGCTTTCGT CGGCGCTGGT ATTACCGGCG CTGTCATC6G CAGTGTGGGC CTAGGGAAGG 5880 
TCCTA6T6GA CATTCTTGCT GGCTAC6GGG CTG6TGTGGC GGGGGCCCTT GTGGCTTTCA 5940 
AGATCATGAG CGGGGACGCC CCCACC6CCG AGGATCTAGT CAACCTTCTG CCTGCCAICC 6000 
TCTCGCCAGG A6CTCTC0TT GTAG6CGTGG TGT6CGCAGC AATACTACGC CGGCACGTGG 6060 
GCCCTGGCGA GGGCGC6GTG CAGTGGATGA ACCGACTGAT AGCGTTTCM^T TCTCGGGGTA 6120 
ACCACGTCTC CCCTACACAC TATGTfiCCAG AGAGCGACfiC GTCA6TC(»T GTCACACATA 6180 
TCCTCACCAO CCTCACTOTC ACTCAGCTCC TGAAAAGGCT CCACGTGTGG ATAA6CTCA6 6240 
ATT6CAC(»C CCCGT6TGCT GGTTCnGGC TCAAAGATGT CTGG6ACT6G ATATGCGAG6 6300 
TfiCTGAGCGA CTTCAACAGT TGGCTfiAAQG CCAAACTTAT 6CCGCAACTG CCCGGGATCC 6360 
CATTCGTATC CTGTCAACGC 6GGTACCGT6 GGGTCTGGCG GGGCGAAGGC ATCAT6CAC6 5420 
CCCGTTGCCC 6TGTGGAGCC 6ATATAACTG GTCATGTCAA AAACGGTTCG ATGAGAATCG 5480 
TCGGCCCTAA GACTTGCA6C AACACCTGGC GTGGGTCGTT CCXCATCAAC GCCCACACTA 6540 
CGGGCCCTTG CACACCCTCC CCAGCGCCGA ACTACACGTT CGCGTTATGG A6GGTGTCGG 5600 
CAGAGGA6TA TGTGGAGGTA AGGCGGCrGG GGGATTTCCA TTACATCACG GGGGTGACCA 6660 
CTGATAAGAT CAAGTGTCCA FGCCAGGTCC CCTCGCCCGA GTTCTTCACA GA66TGGAIG 6720 
6CGTGCGCCI ACATAGGTAC GCCCC(^CCT GCAAACCCCT GCTACG6GAT GA6GTGACGT 6780 
TTAGCATCGG GCTCAATGAA TACTTGGTGG GGTCCCAGTT GCCCTGCGA6 CCCGA6CCA6 6840 
ACGTA6CTGT ACTGACATCA ATGCTTACAG ACCCCTCCCA CATCACTGCA GAGACGGCGG 6900 
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CGCGTAGACT GAATCGGGGfi TCTCCCCCCT CCCTGGCTAG CTCTTCTGCC AGCCAATTflT 6960 

CTGCGCCGTC CCTGAAAGCA ACATGTACCA CCCA(XATGA CTCTCCAGAC GCrGACCTCA 7020 

TAACAQCCAA CCTCCTGTGG AGGCAGGAGA TGGGtSGGGAA CATTAC(»GA GTGGAGTCG6 7080 

A6AATAAGAT CGTCATCCTG (y\TTCTTTCG AC(X;GCTCGr 6GCGGAGGAG GATGATCGGG 7140 

AGATTTCT6T CCCAGCT6AG AHCTGCTGA A6TCTAAGAA GTTTCCCCCC GCCATGCCTA 7200 

TATfiGGCACG 6C(^GATTAT AATCCTCCCC TTGTtKAACC ATG6AAGCGC CC6GACTACG 7260 

AACCACCCTT AGTT(»CGG6 TGCCCCCTAC CACCTCCCAA ACCAACTCC6 GTGC(X3CCAC 7320 

CCC66AGAAA GAGGACAGTG QTGCTGGATG AGTCTACAGT ATCATCTGCT CTGGCTGAGC 73flO 

TTGCCACTAA GACCTTTGGC AGCTCTACAA CCTCAGGCGT GACAAfiTGGT GAAGCGGCCG 7440 

AATCOrCCCC GGCGCCTTCC TGCGACGGTG AACTQGACTC CGAAGCTGAA TCTTACTCCT 7500 

CCATGCCCCC TCrCGAGGGG GMCCGGGGG ACCCCGATCT CAGC6ACGGG TCTTGGTCTA 7560 

CCGTAAGCAG T6ATGGCG6T ACGGAGGACG TCGTGTGCTG CTCGAT6TCC TACT(»]TGGA 7620 

CGGGCQCCCT AATTACGCCC TGTGCCGCAG AGGAAACCAA ACTCCCCATC AAC6(»CTGA 7660 

6TAACTCGCT GCTGCSCCAC CACAATTTGG TGTAnCCAC CACCTCT(MC AGC6CTGGCC 7740 

AGA6GCAGAA AAAA6TCACA TTT6ACAGGC TGCAGGTCCT GGACGATCAT TACCGGGACG 7600 

TGCTCAAGGA GGCTAA6GCC AAGGCATCGA CAGTGAAGGC TAAATTGCTA TCCGTAGAGG 7860 

AGGCAT6TAG CCTGACGCCC CC6CACTCCG CCA6ATCAAA ATTT6GCTAT GGGGCGAAGG 7920 

ATGTC(XIAAG CCATTCCA6T AAGGCCATAC GC<JACATTAA CTCCGTGT6G CAA6ACCTTC 7980 

T6GAG6ACAA TACAACACCT ATAGACACTA CCATCATGGC AAAGAATGAG GTCTTCTGCG 8040 

TGAAGCXCGA AAAGGGGGGC CGCAAGCCCG CT(^TCTTAT C6TGTACCCC GACCTGGGAG 8100 

TGCGCGTATG CGAGAAGAGG GCniGTATG ACGTAGTCAA ACAGCTCCCC ATTGCCGTGA 8160 

T6GGAACCTC CTACGGG7TC CAGTACTCAC CAGCGCAGCG GGT(MACTTC CTGCTTAAT6 8220 

CGTGGAAATC AAA6AAAAAT CCTATG6G6T TTTCCTATGA CAC(XiGTr6C TTT6ACTC6A 8280 

CAGTCACTGA 6GCT6ATATC CGTAC66AGG AAGA(;CTCTA C(;AATCTTGT GACCTGGTCC 8340 

CTGA66CCCG C6CG6CCA1A AGGTCTCTCA (^GAGAGGCT rTACATCGGG GGCCCGCTTA 8400 

CCAACTCTAA GGGACAAAAC rOCGGCTATC GGCGATGCCG CGCGAGCGGC GTGCTGACCA 8460 

CTAGCTGCfiG TAACACCA7A ACCTGTTArC TCAAGGCCAG TQCAGCCTGT CGA6CTGCAA 8520 

AGCTCCGGGA CTGCACTATG CTCGTGTGCG GCGACGACCT CGTCGTTATC TGTGAGAGCG 8580 

CCGGTGKXA GGAGGACGCT GCGAACCTGA GAGCCTTCAC GGAGGCTATG ACCAGGTACT 8640 

CCGCCCCCCC CGGAGACCCG CCTCAACCAG AATACGACTT GGAGCTTATA ACATCTTGTT 8700 

CCTCCAAIGT TTCA6TCGC6 CACGACGGCG CTGGCAAAAG GGTCTACTAT CTGACCCGTG 8760 
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ATCCTflAGAC TCCCCTCGCG CfiTGCCGCTT fi6GAiWCA(5C AAGACACACT CCAGTfiAACT 8820 
CCTGGCTAGG CAACATCATC ATGTTTGCCC CCACTCTGTG GGTACGGAfG GTCCTTAT6A 8880 
CCCATTTHT CTCCATACTC ATAGCCCAGG AACACCTT6A AAAfiGCTCTA 6ATTGTGAAA 8940 
TCrATGGGGC CGTGCACTCC GTCCAACCGT TAGATCTACC TGAAATCATT CAAAGACfCC 9000 
ACGGCCTCA6 CGCGTTCTCG CTCCATA6TT ACTCTCCA6G TGAAATCAAT AGSGTG6CTG 9060 
CATGCCTCA6 6AAACTT6GG GTTCCGCCCT T6CGA6CTTG GA6ACACCGG GCCCGGAGCG 9120 
TCCGCGCCAC ACTCCTATCC CAGGGGGGGA GA6CCGCTAT AT6CG6TAAG TATCTCTTCA 9180 
ACTGGGCSGT GAAAACCAAA CTCAAACTCA CK^^UtC 6TCCGCGTCT (^IGTTGGACT 9240 
T6T(mnG 6TTCAC66GC GGTTACA6C6 GGGGAGACAT TTA7CACA6C GTGTCTCATG 9300 
TCCGGCC(X)6 CTGGTTCTTC TGGTGCCTAC TCCTACTTTC AGTGGG6GTA GGCATTTACC 9360 
TCCTTCCCAA CCGGTAGACG GTTGGQCflAC CACTCCA66C CTTTAGGCCC TSTTTAAACA 9420 
CTCCAQGCCT TTAGQCCCCG TTTTTHnT mTTTTTTT TTTTTTTTTT TTTTTTTTTT 9480 
TTTTTTT 9487 
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Mim^ : 2 
E^J^^S : 1 7 6 5 
iE^ii^^M : 

EM«r)^:cDNA to genomic RNA (YS117-S' CONAI 



CCAT6GCGTT AGTAT6AGTG TCGTGCAGCC TCCAGGACCC CCCCTCCC66 GA6AGCCATA &0 
GTGGTCTGCG GAACCGGTGA GTACACCGGA ATTGCCAGGA CGACCGGGTC CTTTCTTGGA 120 
TCAACCC6CT CAATGCCTG6 AGATTT6G6C GTGCCCCCGC AAGACTGCTA GCCGAGTAGT 180 
GTTGGGTC6C GAAAGGCCTT GT6GTACTGC CTGATAGGGT GCTTGCGAGT GCCCC6GGAG 240 
GTCTCGTAGA CCGTGCACCA TGAGCACAAA TCCfAAACCT CAAAGAAAAA CCAAACGTAA 300 
CACCAACCQC CGCCCACAGG AC6TTAAGTT CCCGQGTGGC GGCCA6ATCG nGGCQGAGT 360 
TTACTTGTTG CCGCGCAGGG GCCCCAGA6T GGGTGTGCGC 6CGACGAGGA AGACnCCGA 420 
GCGQTCGCAA CCTCGC6GAA 6GCGTCAGCC TATCCCCAAG GCCCGCCGAC CCGAGGGTAG 480 
GTCCTG6GCG CAGCCCG6GT ACCCTTGGCC CCTCTATGGC AACGAGGGCT GCGGGTGGGC 540 
G6GAT6GCTC CT6TCCCCCC fiCGGCTCTCG GCCTAGTTGG 6GCCCCACT6 ACCCCC6GCG 600 
GAGGTCACGC AAHTGGGTA AGGTCATCGA TACTCTCACG TGTG6CTTCG CCGACCTCAT 660 
GfiGQTACATC CCGCTCGTCG 6TGCTCCTCT AGG6GGC6CT GCCA6A6CTC rGGCACACGG 720 
T6TTAGAGTT CT6GAAGACG GCGTGAACTA CCCAACAG6G AACCTTCCT6 GTTGCTCCTT 780 
TTCfATCTTC TTGCTCGCTC TTCTATCCTG CCTGACAGTC CCTGCTTCGG CC6TCGAAGT 840 
GCGCAACTCA TCAGGGGTGT ACCATGTaC CAATGATTGC CCCAATGCGT CCGTTGTGTA 900 
CGAGACA6AG AGCCTGATCA TGCATCT6CC CGGGTGTGT6 CCCTGCGTAC GCGAGGGCAA 960 
CGCCTCGAGG TGCTG6GTCT CCCTTAGCCC TACCATTGCC GCTAAGGATC CGAGCGTCCC 1020 
CGTCA6TGAC ArTCGGCGTC ACGTCGACTT GATCGTCGGG GCCGCCGCGT TCTGTTCGGC 1080 
TATGFATGTA GGGGACCTAT GCGGCTCCAT CTTCCTCGTT GGCCA6ATT7 TCACCTTCTC 1140 
TCCCAGGCGC CATTGGACGA CGCAG6ACTG FAATrGCTCC ATCTACCCGG GCCATGTGAC 1200 
AGGTCATCGA ATGGCTTGGG ACATGATGAT GAATTGGTCA CCCACTGGCG CCCTAGTGAT 1260 

GGCGCAGCTA CTCCGGATCC CACAAGCTGT CGTGGATATG ATA6CCGGTG CCCACTGGGG 1320 
TGTCCTAGCG GGCCTGGCAT ACTACTCCAT GGTGG6GAAC TGGGCTAAGG TTGTGGTTGT 1380 
GCTGTTGCTC nCGCTGGCG TCGACGCGGA CACCCAGGTC ACAGGAGGCA GCGCTGCCTA 1440 
7GATGCGC6C GGACTTGCH CCCTTTTCAC CCCAGGCCCT AA6CAAAACA TCCAGCTCAT 1500 
AAATACCAAC GGaGCTGGC ACATCAACAG GACCGCCTT6 AACTGTAATG AAA6CCTGAA 1560 
CACCGQCTGG GTA6CAGGCC TGTTCTACTA TCACAAATTC AACTCCTCGG GGTGTCCTGA 1620 
GAGSATGGCT A6TTGCCAGC CCCTCACC6C CTTTGACCAA 66GT6GGGAC CCATCACTTA 1680 
CGAGGGGAAT GCTAGC6GCG ACCAACGGCC ATATTGCT6G CACTATGCCC CACGCCCGTG 1740 
CGGTAHGIG CCGGCGAGAG AGGTT 1765 
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32 



CO 0 34] 



E^#-t : 3 
W9i<7y^^ : 1 1 9 1 

S^<^^i:cDNA to genomic RNA (YS117-3' cDMA) 

AACAGTCACT 6AGGCTCATA TCCGTACGGA GQAAGACCTC TACCAATCn GTGACCTGGT 60 

CCCCGAGGCC CGCACGGCCA TAA6GTCTCT CACA6AGA6G CTTTACATCG GG66CCCCCT 120 

TACCAATTCC AA66GACAAA ACTGCGGCTA TCGGC6ATGC C6TGCAA6C6 GCGT6CTGAC 180 

CACTAGCTGC G6TAACACCA TAACCTGHA TCTCAAGGCC AGCGCA6CCT 6TCGAGCTGC 240 

AAAGCTCCAG GACTGCACCA T6CTCGTGTG C6GCGACGAC CTCGTCGTfA TCT6TGAGAG 300 

CGCC6GTGTC CA66AGGACG CTGCGAGCCT 6AGAGCCTTC ACGGA6GCTA TGACCAGGTA 360 

CTCCGCCCCC CCG6GAGACC CGCCTCAACC AGAATACGAC TTGGAGCTCA TAACATCCTG 420 

TTCCTCCAAC GTGTCAG7CS CGCACGACGG CTCTGGCAAA AGGGTCTACT ATCTGACCC6 A&Q 

TGATCCT6AG ACTCCCCTCG CGC6TGCCGC TTGG6A6ACA GCAAGACACA CTCCAGT6AA 540 

CTCCT6GCTA GGCAACATCA rCATGITTGC CCCCACTCTG TG6GTACG6A T66TTCTTAT 600 

GACCCATTTT TTTTCCATAC FCATAGCaA 66AGCACCTT GAAAAGGCTC TAGATTGTGA 660 

AATCTATGGA GCCGTACACT CCGTCCAACC 6CTGGACCTA CCTGAAATCA TTCAAA6ACT 720 

CCACG6CCTC AGCGCGTTTT C6CTCCACAG TTACTCTCCA 6GTGAAATCA ATAGGGTG6C 780 

TGCATGCCTC AGAAAACTTG GGGTTCCGCC CTTGCGAGCT TGGAGACACC GGGCCCGGAG 840 

CGTCCGCSCC ACACTCCTAT CCCAG6GG6G GAAAGCTGCT ATATGCGGTA AGTACCTCTT 900 

CAACT6GGC6 GTGAAAACCA AACTCAAACT CACTCCATTA CCGTCCGCG7 CTCAGTTGGA 960 
CTTGTCCAAT TGGTTCACGG 6CGGCTACAG CGG6GGAGAC AHTATCACA GCGTGTCTCA 1020 
TGTCCG6CCC CGTTGGTrCT TCT6GT6CCT ACTCCTACTT TCAGTG6GGG FAGGCATCTA 1080 
TCTCCnCCC AACCGATAGA CGGTTGGGCA ATCACTCCTA 6CCTTTAGGC CTTATTTAAA 1140 
CACTCCAG6C CTTTAGGCCC TGTTTTTni TTTTTTTTTT TTTTTTTTTT T 1191 



[0 0 3 5] 



i|$M¥6-3 1 9 5 6 3 



33 



34 



eyii<73^$ : 1 7 6 5 

w.m^m : 

E^lf^^SS: cDNA to senomic RNA (SR037-S' cOMA) 

CCATGGCGrr ACTATGAGTG TCGTGCAGCC TCCAGGACCC CCCCTCCCGG GAGA6CCATA 60 

GTGGTCTGCG GAACCG6TGA 6TACACCQGA ATTGCCA6GA CGACCGGfiTC CTTTCrTGGA 120 

TTAACCC6CT CAAT6CCT6G AGATTTGGGC 6TGCCCCCGC AA6ACTGCTA 6CCGA6TAGT 1S0 

GTTGGGTCGC GAAA6GCCTT GTGGTACFGC CFGATAGGGT GCTTGCGAGT QCCCC6GGAG 240 

6TCTCGTA6A CC6TGCACCA T6AGCACGAA TCCTAAACCT CAAAGAAAAA CCAAACGTAA 300 

CACCAACCSC CGCCCACA66 ACGTCAAGTT CCC6GGTG6C GGCCAGATC6 nCGCGGAGT 360 

TTACTTGTT6 CC6C6CA6GG GCCCCAGAAT 6GGTGTGCGC GCGACGAG6A A6ACTTCCGA 420 

GCG6TCGCAA CCTCGCGGAA GGCGTCAGCC TAHCCCAAG GCCC6CC6AC CC6AGG6TAG 480 

6TCCT6G6CG CAGCCCGGGT ACCCTTGGCC CCTCTATGGT AAC6AGGGCT GTGGGT6GGC 540 

GGGATGGC7T CTGTCCCCCC GC6GT7CCCG GCCTAGTTGG GGCCCCTCTG ACCCCCGGCG 600 

GAGGTCACGC AACTTGGGTA AGGTCATCGA TACCCTCACG TG7GGCTTCG CCGACCTCAT 660 

G6GGTACATC CCGC7CGTCG G7GC7CC777 AG6GGGCGC7 GCCA6AGC7C 7GGCGCA7GG r20 

7G7CA6AQ77 C7GGAA6ACG GC67GAA77ft 7GCAACAGGG AACC7ICCCG GT7GC7C777 780 

77CTA7CnC 7rGC77GCCC 77C7A7CC7G CC7GACAG7C CC7GCrTCGG CCGTCGGA67 840 

GCGCAAC7C7 7CGGGGG7G7 ACCA7G7CAC CAA7GA77GC CCCAA7GCG7 CTG7TG7GrA 900 

CGAGACA6AG AGCC7GA7CA 7ACA7C7GCC CGGG7G7G7G CCC7GCG7AC GCGAGGGCAA 980 

CGCC7CGA6G 76C7GGGrC7 CCC77AG7CC 7AC7G77GCC GC7AAGGA7C CGAGCG7CCC 1020 

CG7CAG7GAG A7rCGACGCC A7G7C6ACC7 6A77G7C6GG GCCGCTGCG7 7CT677CCGC 1080 

7ATG7ACG7A 6G6GACC7A7 GCGGC7CCA7 C77CC7CG77 G6CCAGAT77 TCACCC7CTC lUO 

7CCCAGGCG7 CAC7GGACGA CGCAG6AC7G TAAT7Q77CC A7C7ACCCAG GCCATQ7GAC 1200 

AGG7CA7CfiA A7GGC77GGG ACA76ArGA7 GAAT7GG7CA CC7AC7GGCG CCC7AG7G6T 1260 

GGCGCAGC7A C7CCGGA7CC CACAAGC7G7 CG7GGA7ATG ATAGCCGG7G CCCAC7GGGG 1320 

7G7CC7AGCG GGCC7GGCA7 AC7A77CCA7 6G7GGGGAAC 7G6GC7AAGG 77G76GT7G7 1380 
GC7GC7AC77 777GC7G6CG 7CGA7GCAGA GACCCAGG7C 7CAQGAGGC7 CCGC7GCCCA 1440 

AACCACG7AC GG7C7TAC7G CCC7C77CAG GACAGGCCC7 AA7CAAAAAA TCCAGCTCAT 1500 

AAA7ACCAAC GGCAGC7GGC A7A7CAACA6 GACCGCC7TG AAC7G7AA7G AGAGCT7GCA 1560 

CACCGGC7GG C7GGCAGCGC 7G77C7ACAC CCACAAG77C AAC7C77CGG GG7G7776GA 1620 

GAGGA7G6CC AG77GCCAGC C7C777CCGC C77CGACCAA GGG7GGGGAC CCA7CAC77A 1680 

CGGGGGGAA7 GC7AGCGACG 6CCAACGGCC A7AT7GC7GG CAC7A7GCCC CACGCCCGTG 1740 

CGG7AnG7G CCGGCGA6AG AGGn 1765 
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(19) #S¥6-3 1 9 563 

35 36 

[0036] 

EJf^:^? : 1 1 7 9 

mM<rim. : ^Br 

: 

E?J<^S^:cDNA to genomic RNA (SR037-3' cOHA) 

AACA6TCACT GAGGCTGATA TIX^GCACGGA GGAA6ACCTC TACCAATCTT GTGACCTGGT 60 
CCCT6AC6CC CGCAC66CCA TAAGGTCCCT CACA6AGA66 CTTTACATC6 G6GGCCCGCT 120 
TACCAATTCr AA6G6ACAAA ACTGCGGCTA TC66CGATGC CGCGCAAGCG 8CGT6CTGAC 180 
CACTAGCTGC GGTAACACCA TAACCT6TTA TCTCAAGGCC AGTGCAGCCT GTCGA6CTGC 240 
AAAGCTCC6G GACTQCACTA T6CTC6TGTG CG6CGATGAC CTTGTCGTTA TCTCTGAGAG 300 
CGCCGGTGTC CAGGAGGACG CTGCGAGCCT GAGAGCCTTC ACGGAGGCTA TGACCAGGTA 360 
CTCTGCCCCC CC66GAGACC CGCCTCAACC AGAATACGAC TT6GAGCTTA TAACAT(^TG 420 
TTCCTCCAAT GTGTCAGTCG CGCACQACG6 CGC7GGCAAA AGGGTCTACT ATCTGACCCG 480 
TGATCCTGAG ACCCCCCTCG CGCGIGCCGC TTGGGAGACA GCAAGACACA CTCCAGTGAA 540 
CTCCT6GCTA GGCAACA7CA TTATGTTTGC CCCCACTTT6 rOGGTACGGA TCGFCCTCAT 500 
6ACCCATTTT TFCTCCATAC TCAFAGCCCA C6AGCACCTT GAAAAGGCTC 7AGATT6TGA 660 
AATCTATGGA GCCGTACACT CCATCCAACC 6CTGGACCTA CCTGAAATCA TTCAAA6ACT 720 
CCACGGCCTC AGC6CGTTTT CGCTC{iA(iAG TTACTCTCCA 6GTGAAATCA ATAGGGTGGC 780 
TGCATGCCrC AGAAAACTTG GGGTTCXGCC CTTGCGAGCT TGGAGACACC GGGCCCG6AG 840 
CGTCCGC6CC ACACTCCTAT CCCAGGGG6G GAAAGCCGCT ATATGCGGTA AGTACCTCTT 900 
CAACTGGGCG GTGAAAACCA AACTCAAACT CACrCCAFTA CCGTCCGCGF CTCAGTTGGA 960 
CTFGTCCAAr 7G6TTCACGG 6C6GC7ACAA CG6GGGAGAC A77TA7CACA GC67G7C7C6 1020 
7GTCC66CCC CG776677C7 rC76G7GCC7 AC7CC7ACrC 7CAG7GGGGG TAGGCATCFA 10«0 
7CTCC77CCC AACCGA7AQA CG677G667A A7CAC7CCAA GCC777AGGC CC77TT7AAA 1140 
CACrCCAGGC C7777GGCCC 7G77777T77 777717777 1179 



[0 0 3 7] fiB?l|S^: 6 ®©» : -2 

mn<r>&-i : 2 0 40 h^sO'J- : UB^ 

E5IJ®S:«K E?(l©ffl!S: cDNA to genomic RNA 



hijinx- : m.m^ E5>J (#18 6) 

cDNA to genomic RNA AYGTACCCCA YGAGRTCGGC 20 



mk^^htc:^& : E (YttTS;t(±C » R^GSfcttA) 

ffi?rj(#3 2 1) [0 0 3 9] JkiJiJ*^^: 8 

AACCTCCGCC GGGGATCAGA 2 0 E?JC0:g$ : 2 0 

[0 0 3 8] mm^^ : 7 Mm<Dm m& 

gH?B®SS : 2 0 moiSc : -*® 
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37 

£3^<oaS:cDNA to genomic RNA 

^m&\.ifn^ : E 

§im (#2 56) 

CGCGCGMCNA GGAARRCTTC 2 0 
(MttASfcttC . NttA, T. CS&ttG . Rtt 
A*fcttG) 

[0 0 4 0] mrnmn: 9 

E5>J©fia : 2 0 
iBJ^icOS! : mi 
m<DS. : -*® 

BB5»)®aS: cDNA to genomic RNA 

<tm&^vit:^m ■ E 

lEJfJ (# 1 0 4) 

AGRAARRCTT CSGAGCGRTC 2 0 
(RttGa:fe:tiA . SttCS&aO . ) 

[0 0 4 1] mmn: \ o 

E?B®S$ : 2 0 

^oa : 

h:i}?nv- : mm!. 

E?J®aS:cDNA to genomic RNA 
mL^9i'&L1f)3^ : E 
jEJfrj (#13 2) 

YRCCTTGGGC ATAGGCTGAC 20 

(YliTS&ttC , RttGS&ttA . ) 

[0 0 4 2] E^iJS^: 1 1 
EJlJWgS : 2 0 

&m(r>m : «S 

«®& : 

E?J©aS:cDNA to genomic RNA 
ffiJlJ (#13 3) 



# 

(20) 1$M¥6-3 1 9 56 3 

38 

GARCCAWCCT GCCCAYCCYA 20 
(RtiGSfcttA , WttT*fcf±A o YttCSfctt 
T » ) 

[0 0 4 3] E^J#^: 1 2 
E5tJ(0fi$ : 2 0 

m<D& : 

: cDNA to genomic RNA 

10 ^m^mmLfziimiE 

(#1 3 4) 

CCAARAGGGA CGGGARCCTC 2 0 

(RfiG^:tttA o ) 

[0 0 4 4] S?tJ#^: 13 
E^Og^ : 2 0 

E3f»JoaS : cDNA to genomic RNA 
20 !^^^^Lfc^^:E 
E^J (#13 5) 

RCCYTCGTTT CCRTACAGRG 20 

[0 0 4 5] E^]#^: 14 
m^KDS:-^ : 2 0 

EJOCOaiS : cDNA to genomic RNA 
30 mL^^^LTzyim: E 
E??ij (#2 9 6) 

GGATAGGCTG ACGTCTACCT 20 
[0 0 4 6] 



1$08^6-3 1 9 5 6 3 



39 



40 



mm^ : 1 5 

WM(n-^^ : 3 0 1 1 

iE^i<^S^ ■-W^'S. ( HC-Gd aiB i no ac t d ) 

Met Ser Thr Asn Pro Lys Pro Gin Arg Lys Thr Lys Arg Asn Thr 
5 10 IS 

Asn Ai^ Arg Pro Gin Asp Vai Lys Phe Pro Gly Gly Gly Gin He 
20 25 30 

Val Gly Gly Val Tyr Leu Leu Pro Arg Arg Gly Pro Arg Val Giy 
35 40 45 

Val Arg Ala Thr Arg Lys Thr Ser Glu Arg Ser Gin Pro Arg Gly 
50 55 60 

Arg Arg Gin Pro He Pro Lys Ala Arg Arg Pro Glu Gly Arg ser 
65 70 75 

Trp Ala Gin Pro Gly Tyr Pro Trp Pro Leu Typ Gly Asn Glu Gly 
SO S5 90 

Cys Gly Trp Ala Gly Trp Leu Leu Ser Pro Arg Gly ser Arg Pro 
95 10O 105 

Ser Trp Gly Pro Scr Asp Pro Arg Arg Arg Ser Arg Asn Leu Gly 
110 115 120 

Lys Val He Asp Thr Leu Thr Cys Gly Phe Ala Asp Leu Het Gly 
125 130 135 

Tyr He Pro Leu Val Gly Ala Pro Leu Gly Gly Ala Ala Arg Ala 
140 145 150 

Leu Ala His Gly Val Arg Val Leu Glu Asp Gly Val Asn Tyr Ala 
155 160 165 

Thr Gly Asn Leu Pro Gly Cys Ser Phe Ser He Phe Leu Leu Ala 
170 175 180 

Leu Leu Ser Cys Leu Thr Val Pro Ala Ser Ala Val Gly Val Arg 
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(22) i|$§B¥6-3 1 9 5 6 3 

41 42 

185 190 195 

Asn Ser Ser Q!y Val Tyr His Val Tttr Asn Asp Cys Pro Asn Ala 

200 205 210' 

Ser Val Val Tyr Glu Thr Giu Asn Leu He Het His Leu Pro Gly 

215 220 225 

Cys Val Pro Tyr Vai Arg Giu Gly Asn Ala Ser Arg cys Trp Val 

230 235 240 

Ser Leu Ser Pro Thr Val Ala Ala Arg Asp Ser Arg Vai Pro Val 

235 240 245 

Ser Glu Val Arg Arg Arg Val Asp Ser He Vai Gly Ala Ala Ala 

250 255 260 

Phe Cys Ser Ala Hec Tyr Val Gly Asp Leu Cys Gly Ser lie Phe 

265 270 275 

Leu Val Gly Gin lie P tie Thr Phe Ser Pro Arg His His Trp Thr 

280 235 290 

Thr Gin Asp Cys Asn Cys Ser lie Tyr Pro Gly His Val Thr Gly 

295 300 305 

His Arg Hfet Ala Trp Asp Met Het Het Asti Trp Ser Pro Thr Gly 

315 320 325 

Ala Leu Val Val Ala Gin Leu Leu Arg lie Pro Gin Ala lie Val 

330 335 340 

Asp Het lie Ala Gly Ala His Trp GIv Vat Leu Ata Gly Leu Ala 

345 350 355 
Tyr Tyr Ser Het Val Gly Asn Trp Ala Lys Val Val Val val leu 

360 365 370 

Leu Leu Phe Ala Gly Val Asp Ala Gtu Thr Arg Vai Thr Gly Gly 

375 380 385 

Ala Ala Gly His Thr Ala Phe Gly Phe Ala ser Phe Leu Ala Pro 

390 395 . 400 

Gly Ala Lys Gin Lys He Gin Leu He Asn Thr Asn Gfy Ser Trp 

405 410 415 
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(23) it$M¥6-3 1 9 5 6 3 

43 44 

His He Asfl Arg Thr Ala Leu Asn Cys Asn Giu Ser Leu Asp Tlir 

420 425 430 

Gly Trp Leu Ala Gly Leu Leu Tyr Tyr His Lys Phe Asn Ser Ser 

435 440 445 

Gly Cys Pro 6lu Arg Het Ala Ser Cys Gin Pro Leu Thr Ala Phe 

450 455 460 

Asp Gin Gly rrp Gly Pro lie Thr His Qlu Gly Asn Ala Ser Asd 

465 470 475 

Asp Girt Arg Pro Tyr Cys Trp His Tyr Ala Leu Arg Pro Cys Gly 

460 465 490 

He val Pro Ala Lys Lys Val Cys Gly Pro Val Tyr Cys Phe Thr 

495 500 505 

Pro Ser Pro Val Val Val Gly Thr Thr Asp Arg Ala Gly Val Pro 

510 515 520 

Thr Tyr Arg Trp Gly Ala Asn Glu Thr Asp Val Leu Leu Leu Asn 

525 530 535 

Asn Ser Arg Pro Pro Het Gly Asn Trp Phe Gly Cys Thr Trp Hec 

540 545 550 

Asn Ser Ser Gly Phe Thr Lys Thr Cvs Gly Ala Pro Ala Cys Asn 

555 560 575 

He Gly GIv Ser Gly Asn Asn Thr Leu Leu Cys Pro Tlir Asp Cys 

570 5 75 5 80 

Phe Arg Lys His Pro Asp Ala Thr Tyr Ser Arg Cys Gly Ser Gly 

585 590 595 

Pro Trp Leu Thr Pro Arg Cys Leu Val Asp Tyr Pro Tyr Arg Leu 

600 605 610 

Trp His Tyr Pro Cys Thr Val Asn Tyr Thr He Phe Lys He Arg 

615 620 625 

Hec Phe Val Gly Gly Val Glu His Arg Leu Asp Ala Ala Cys Asn 

630 635 640 

Trp Thr Arg Gly Glu Arg Cys Asp Leu Asp Asp Arg Asp Arg Ala 
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45 46 

650 655 660 

aiu Leu Sep Pro Leu Leu Leu Ser Thr Thr Gin Trp Gin Val Leu 

665 670 675 

f^po Cys Ser (>he Thr Thr Leu Pro Ala Leu Ser Thr fily Leu Ife 

6«0 685 630 

His Leu His Gin Asn He Val AsD Val Gin Tyr Leu Tyr Gly Leu 

6^ TOO 705 

Ser Ser Ala Val Thr Ser Trp Val lie Lys Trj m Tyr Val Val 

715 720 
Leu Leu Phe Leu Leu Leu Ala Asp Ala Arg ite Cys Ala cys Leu 

725 730 
Trp Met Het Leu Leu He Ser Gin Val Glu Ala Ala Leu Glu Asn 
no 745 750 

Leu lie val Leu Asn Ala Ala Ser Leu Val Gly Thr His Gly He 
755 760 765 

Val Pro Phe Phe He Phe Phe Cys Ala Ala Trp Tyr Leu Lys Gly 

r75 780 
Lys Trp Ala Pro Gly Leu Ala Tyr Ser Val Tyr Gly Het Trp Pro 
785 790 795 

Leu Leu Leu Leu Leu Leu Ala Leu Pro Gin Arg Ala Tyr Ala Leu 
«00 505 810 

Asp Gin Glu Leu Ala Ala Ser Cys Gly Ala Thr Val Phe He Cys 

815 820 
Leu Ala Val Leu Thr Leu Ser Pro Tyr Tyr Lys Gin Tyr Met Ala 
"0 835 440 

Arg Gly He Tro Trp Leu Gin Tyr Het Leu Thr Arg Ala Glu Ala 

850 855 
Leu Leu Gin Val Trp Val ProP ro Leu Asn Ala Arg Gly Gly Arg 
860 865 a 70 

ASP Gly Val Val Leu Leu Thr Cys Val Leu His Pro His Leu Leu 
885 890 
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(25) i}$M¥6-3 1 9 5 6 3 

47 48 

Phe Glu He Thr Lys He Met leu Ala He Leu Gly Pro Leu Trp 

900 S05 
He Leu Gin Ala Ser Leu leu Lys Val Pro Tyr Phe Val Arg Ala 
915 92a 925 

His Gly Leu He Arg Leu Cys Met leu Val Arg Lys Thr Ala 6ly 
930 935 940 

Gly Gin Tyr Va IGIn Met Ala Leu Leu Lys Leu Gly Ala Phe Ala 
945 950 .955 

Gly Thr Tyr He Tyr Asn His Leu Ser Pro Leo Sin Asp Tro Ala 
960 965 970 

His Ser Gly Leu Arg Asd Leu Ala Val Ala Thr Glu Pro Val He 
975 9S0 985 

Phe Ser Arg Met Glu He Lys Thr He Thr Trp Gly Ala Asp Thr 
990 995 ifloa 

Ala Ala Cys Gly Asp He He Asn Gly leu Pro Val Ser Ala Arg 
1005 1010 1015 

Arg Gly Arg Glu Val Leu Leu Gly Pro Ala Asp Ala Leu Thr Asp 
1020 1025 103O 

Lys Gly Trp Arg Leu Leu Ala Pro He Thr Ala Tyr Ala Gin Gin 
1035 1040 1045 

Thr Arg Gly Leu Leu Gly Cys He He Thr Ser leu Thr Gly Arg 
1050 1055 1060 

Asp Lys Asft Gin Val Gl uGly Glu Val Gin He Val Ser Thr Ala 
1065 1070 1075 

Thr Gin Thr Phe Leu Ala Thr Cys Val Asn Gly Val Cys Trp Thr 
1080 1085 1090 

Val Tyr His Gly Ala Gly Ser Arg Thr He Ala Ser Ala Ser Gly 
1095 1100 1105 

Pro Val He Gin Met Tyr Thr Asn Val Asp Gin Asp Leu Val Gly 
1110 HIS 1120 

Trp Pro Ala Pro Gin Gly Ala Arg ser Leu Thr Pro Cys Thr Cys 
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49 50 

1025 1030 1035 

Gly Ala Ser A&o Leu Tyr Leu Val Thr Arg His Ala Asp \ral He 

1040 1045 1050 

Pro Val Arg Arg Arg Gly Asp Asn Arg Gly Ser Leu Leu Ser Pro 

1055 1060 1065 

Arg Pro He Ser Tyr Leu Lys Gly Ser Scr Gly Gly Pro Leu Leu 

1070 1075 1080 

Cys Pro Het Gly His Ala Val Gly He Phe Arg Ala Ala Val Cys 

1085 1090 1095 

Ttir Arg Gly val Ala Lys Ala Val Asp Phe Val Pro Val Glu Ser 

1100 1105 1110 

leu Glu Thr Thr Hel Arg Ser Pro Val Phe Thr Asp Asn Ser Ser 

1115 1120 1125 

Pro Pro Thr Val Pro Gin Ser Tyr Gin Val Ala His Leu His Ala 

1130 1135 1140 
Pro Thr Gly Ser Gly Lys Ser Thr Lys Val Pro Ala Ala Tyr Ala 

1145 1150 1155 

Ala Gin Gly Tyr Lys Val Leu Val Leu Asn Pro Ser Val Ala Ala 

1160 1165 1170 

Thr Leu Gly Phe Gly Ala Tyr Mc[ Ser Lys Ala His Gly He Asp 

1175 1130 lias 

Pro Asn Val Arg Thr Gly Val Arg Thr He Thr Thr Gly Ser Pro 

1190 1195 1200 

He Thr His Ser Thr Tyr Gly Lys Phe Leu Ala Asp Gly Gly Cys 

1205 1210 1215 

Ser Gly Gly Ala Tyr Asp He He He Cys Asp Glu Cys His Ser 

1225 1230 1235 

Val Asp Ala Thr Ser Me Leu Gly He Gly Thr Val Leu Asp Gin 

1240 1245 1250 

Ala Glu Thr Ala Gly Val Arg Leu Thr He Leu Ala Thr Ala Thr 

1255 1260 1265 



—814— 



# 



«pffl¥6 - 3 1 9 5 6 3 



51 



52 



Pro Pro &lv Ser Val Thr Val Pro His Ser Asn He Glu Glu Val 

1270 1275 1280 

Ala Leu Ser Thr Glu Gly Glu lie Pro Phe Tyr Gly Lys Ala He 

1385 1290 1295 

Pro Leu Asn Tyr He Lys Gly Gly Arg His Leu He Phe Cys His 

1300 1305 1310 

Ser Lys Lys Lys Cys Asp Glu Leu Ala Ala Lys Leu Val Gly Leti 

1315 1320 1325 

Gly Val Asn Ala Val Ala Ptie Tyr Arg Gly Leu Asp Val Ser Val 

1330 1335 1340 

He Pro Thr Thr Gly Asp Val Val Val Val Ala Thr Asp Ala Leu 

1345 1350 1355 

ec Thr Gly Tyr Thr Gly Asp Phe Asp Ser Val He Asp Cys Asn 

1360 136S 1370 

Thr Cys Val Val Gin Thr Val Asp Phe Ser Leu Asp Pro Thr Phe 

1375 1380 1385 

Ser He Glu Thr Ser Thr Val Pro Gin Asp Ala Val Ser Arg Ser 

1390 1395 1400 

Gin Arg Arg Gly Arg Thr Gly Arg Gly Lys His Gly He Tyr Arg 

1405 1410 1415 

Tyr Val Ser Pro Gly Glu Arg Pro Ser Gly Met Phe Asp Ser Val 

1420 1525 1430 

Val Leu Cys Glu CysTyr Asp Ala Gly Cys Ala Trp Tyr Glu Leu 

1435 1440 1445 

Thr Pro Ala Glu Thr Thr Val Arg Leu Arg Ala Tyr Leu Asn Thr 

1450 1450 1455 

Pro GIv Leu Pro Val Cys Gin Asp His Leu Glu Phe Trp Glu Ser 

1465 1470 1475 

Val Phe Thr Gly Leu Thr His lie Asp Ala His Phe Leu Ser Gin 

1480 1485 1490 

Thr Lys Gin Ser Gly Glu Asn Phe Pro Tyr Leu Val Ala Tyr Gin 
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5? 54 

1495 1S00 1505 

Ala Thr Val Cys Ala Arg Ala Lys Ala Pro Pro Pro Sen Trp Asp 

1510 1515 1520 

Gin Het Trp Lys Cys Leu lie Arg Leu Lys Pro Thr Leu Thr Gly 

1525 1530 1535 

Ala Thr Pro Leu Leu Tyr Arg Leu Gly Sly Val Gin Asn Glu Me 

1540 1545 1550 

Thr Leu Thr His Pro He Thr Lys Tyr He Het Ala Cys Hec Ser 

1555 1560 1565 

Ala Asp Leu Glu Val Val Thr Ser Thr Trp val Leu Val Gly Gly 

15^0 1575 1580 

Val Leu Ala Ala Leu Ala Ala Tyr Cys Leu Ser Thr Gly Ser Val 

1585 1590 1595 

Val lie Val Gly Arg He He Leu Ser fily Lys Pro Ala Val He 
1600 1605 1610 

Pro Asp Arg Glu Val Leu Tyr Arg Glu Phe Asp Glu Met Glu Glu 
1615 1620 1625 

cys Ala Ala His He Pro Tyr Leu Glu Gin Gly Met His Leu Ala 
1^30 1635 1640 

Glu Gin Phe Lys Gin Lys Ala Leu Gly Leu Leu Gin Thr Ala Ser 
1645 16S0 1655 

Lys Gin Ala Glu Thr He Thr Pro Ala Val His Thr Asn Tro Gin 
1660 1665 1670 

Lys Leu Glu Ser Phe Trp Ala Lys His Het Trp Asn Phe Val Ser 
1675 1680 1685 

Gly lie Gin Tyr Leu Ala Gly Leu Ser Thr Leu Pro Gly Asn Pro 
1690 1695 1700 

Ala lie Ala Ser Leu Het Ser Phe Thr Ala Ala Val Thr Ser Pro 
1705 1710 1715 

Leu Thr Thr Gin CIn Thr Leu Leu Phe Asn lie Leu Gly Gly Trp 
1720 1725 1730 
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1 9 5 6 3 



56 



Val A(a Ala Gin Leu Ala Ala Pro Ala Ala Ala Thr Ala Phe Vat 

U35 1740 1 745 

Gly Ala Gly Fie Thr Gly Ala Val He Gly Ser Val Gly leu Gly 

USO 1755 1760 

Lys Val Leu Val Asp He Leu Ala Gly Tyr Gly Ala Gly Val Aia 

1765 1770 1775 

Gly Ala Leu Val Ala Ptie Lys lU Met Ser Gly Glu Aia Pro Thr 

1730 1715 1790 

Ala Glu ASD Leu Val Asn Leu Leu Pro Ala He Leu Ser Pro Gly 

1795 1800 1805 

Ala Leu Val Val Gly Val Val Cys Ala Ala He Leu Arg Arg His 

UIO 1815 1820 

Val Gly Pro Giy Glu Gly Ala Vai Gin Trp Net Asn Arg Leu He 

1825 1830 1835 

Ala Phe Ala Ser Arg Gly Asn His Val Ser Pro Thr His Tyr Val 

1840 1845 1850 

Pro Glu Ser Asp Ala Ser Val Arg Val Thr His He Leu Thr Ser 

1855 1860 1865 

Leu Thr Val Thr Gin Leu Leu Lys Arg Leu His Val Tro He Ser 

1870 1875 1880 

Ser Asp Cys Thr Ala Pro Cys Ala Gly Ser Trp Leu Lys Aso Val 

1885 1890 1895 

Trp Asp Trp lie Cys Glu Val Leu Ser Asp Phe Lys Ser Trp Leu 

1900 1905 1910 

Lys Ala Lys Leu Met Pro Gin Leu Pro Gly He Pro Phe Val Ser 

1915 1920 1925 

Cys fiin Arg Gly Tyr Arg Gly Val Trp Arg Gly Glu Gly lie Hec 

1930 1935 1940 

Ills Ala Arg Cys Pro Cys Gly Ala Asp lie Thr Gly His Vai Lys 

1945 1950 1955 

Asn Gly Ser Hec Arg He Vai Gly Pro Lys Thr Cys Ser Asn Thr 
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57 58 

I960 1965 1970 

Trp Arg Gly Ser m Pro rie Asn Ala His flir Thr Gly Pro Cys 
1975 19M 1985 

Thr Pro Ser Pro Ala Pro Asn Tyr Thr Phe Ala leu Trp Arfl Val 

1995 2000 
Ser Ala Glu Glu Tyr Val 6lu Val Arg Am Leu Gly Asp Phe His 
2005 2010 2015 

Tyr He Thr Gly Val Thr Thr Asp Lys He Lys Cys Pro Cys Gin 
2020 2025 2030 

Val Pro Ser Pro Glu Phe Phe Thr Glu Val Asp Gly Val Arg Leu 
2035 2040 2045 

His Arg Tyr Ala Pro Pro Cys Lys Pro Leu Leu Arg Asp Glu Val 
2050 2055 2360 

Thr Phe ser lie Gly Leu Asn Glu Tyr Leu Val Gly Ser Gin Leu 
2065 2070 2375 

Pro Cys Glu Pro Glu Pro Asp Val Ala Val Leu Thr Ser Het Leu 
2080 20S5 2390 

Thr Asp Pro Ser His He Thr Ala Glu Thr Ala Ala Arg ArgLeu 
2005 2040 2445 

Asn Arg Gly Ser Pro Pro Ser Leu Ala Ser Ser Ser Ala Ser Gin 
2110 2115 2420 

Leu Ser Ala Pro Ser Leu Lys Ala Thr Cys Thr Thr His His Asp 
2125 2130 2435 

Ser Pro Asp Ala Asp Leu lie Thr Ala Asn Leu Leu Trp Arg Gin 
2140 2145 2450 

Glu Met Gly Gly Asn Fie Thr Arg Val Glu Ser Glu Asn Lys He 
2155 2160 2465 

Val He Leu Asp Ser Phe Asp Pro Leu Val Ala Glu Glu Asp Asp 
2170 2175 2480 

Arg Glu He Ser Val Pro Ala Glu He Leu Leu Lys Ser Lys Lys 
2185 2190 2495 
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The Pro Pro AU Het Pro He Trp Ala Arg Pro Asp Tyr Asrt Pro 
Pro Leu Val 6lu Pro Trp Lys Arg Pro Asp Tyr Glu Pro Pro Leu 
Val His Gly Cys Pro LeiJ Pro Pro Pro Lys Pro Thr Pro Val Pro 
Pro Pro Arg Arg Lys Arg Thr Val Val Leu Asp eiu Sor Thr Val 
Ser Ser Ala Leu Ala Glu Leu Ala Thr Lys Thr Phe Gly Ser Ser 
Thr Thr Ser Gly Val Thr Ser Gly Glu Ala Ala Glu Ser Ser Pro 

Ala Pro Ser Cys Asp Sly Glu Leu Asp SerGlu Ala Glu Ser Tyr 
Ser Ser Met Pro Pro Leu Glu Gly Glu Pro Gly Asp Pro Asp Leu 
Ser Asp Gly Ser Trp Ser Thr Val Ser Ser Asp Gly Gly Thr Glu 
Asp Val Vai Cys Cys Ser Hoi Ser Tyr Ser Trp Thr Gly Ala Leu 

lie Thr Pro Cys Ala AlaG lu Glu Thr LysLeu Pro He Asn Ala 
Leu Ser Asn Ser Leu Leu Arg His His Asn Leu Val Tyr Ser Thr 
Thr Ser Arg Ser Ala Gly Gin Arg Gin Lys Lys Val Thr Phe Asp 
Arg Leu Gin Val Leu Asp Asp His Tyr Arg Asp Val Leu Lys Giu 

Ala Lys Ala Lys Ala Ser Thr Val Lys AlaLys Leu Leu Ser Vat 
Glu Gfu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Arg Ser Lys 
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Phe 6ly Typ Gly Ala Lys Asp Val Arg Ser His Ser Ser Lys Ala 
Glu Glu Ala Cys Ser Leu Thr Pro Pro His Ser Ala Arg Ser Lys 
Tyr Asp Val Val Lys Gin Leu Pro He Ala Val Het Giy Thr Ser 
Cys Val Lys Pro Glu Lys Gly Gly Arg Lys Pro Ala Aro Leu He 
Val Tyr Pro Asp Leu Gly Val Arg Val Cys Glu Lys Arg Ala Leu 
Tyr Asp Val Val Lys Gin Leu Pro He Ala Val Het Gly Thr Ser 
Tyr Gly Phe Gin Tyr Ser Pro Ala Gin Arg Val Asp Phe Leu Leu 
Asn Ala Trp Lys Ser Lys Lys Asn Pro Het Gly Phe Ser Tyr Asp 
Thr Arg Cys Phe Asp Ser Thr Val Thr Slu Ala Asp lie Arg Thr 
Glu Glu Asp Leu Tyr Gin Ser Cys Asp Leu Val Pro Glu Ala Arg 
Ala Ala He Arg Ser Leu Thr Glu Arg Leu Tyr He Gly Gly Pro 
Leu Thr Asn Ser Lys Gly Gin Asn Cys Gly Tyr Arg Arg Cys Arg 
Ala Ser Gly Val Leu Thr Thr Ser Cys Gly Asn Thr He Thr Cys 
Tyr Leu Lys Ala Ser Ala Ala Cys Arg Ala Ala lys Leu Arg Asp 
Cys Thr Het Leu Val Cys Gly Asp Asp Leu Val Val He Cys Glu 
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Ser Ala Gly Val Gin Giu Asp Ala Ala Asn Leu Arg Ala Thr 

Glu Ala Het Thr Arg Tyr Ser Ala Pro Pro Gly Asp Pro Pro Gin 

Pro Glu Tyr Asf Leu Glu Leu lie Thr Ser Cys Ser Ser Asa Val 

ser Val Ala His Asp Gly Ala Gly Lys Arg Val Tyr Tyr Leu Thr 

Arg Asp Pro Glu Thr Pro Leu Ala Arg Ala Ala Tro Glu Thr Ala 

Arg His Thr Pro Val Asn Ser Trp Leu Gly Asn lie He Het Phe 

Ala Pro Thr Leu Trp Val Arg Het Val Leu Het Thr His Phe Phe 

Ser He Leu He Ala Gin Glu His Leu Glu Lys Ala Leu Asp Cys 

Glu He Tyr Gly Ala Val His Ser Val Glrj Pro Leu Asp Leu Pro 

Glu He He GlA Arg Leu His Gly Leu Ser Ala Phe Ser Leu His 

Ser Tyr Ser Pro Gly Glu He Asn Arg Val Ala Ala Cys Leu Arg 

Lys Leu Gly Val Pro Pro Leu Arg Ala Trp Arg His Arg Ala Arg 

Ser Val Arg Ala Thr Leu Leu Ser Gin Gly Gly Arg Ala Ala He 

s Gly Lys Tyr Leu Phe Asn Trp Ala Vaf Lys Thr Lys Leu Lys 

Leu Thr Pro Leu Pro Ser Ala Ser Gin Leu Asp Leu Ser Asn Trp 

Phe Thr Gly Gly Tyr Ser Qly Gly Asp I le Tyr His Ser Val Ser 
His Val Arg Pro Arg Trp Phe Phe Trp Cys Leu Leu Leu Leu Ser 

Val Gly Val Gly He Tyr Leu Leu Pro Asn Arg 



[0 04 7] 
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65 66 

E5liO:^$ : 5 0 2 

SlM^oajS : ga^ (YS117-5- aBino acid) 

HetSerThrAsnPpo LysProGlnArgLys ThrLysArgAsnThr AsnAroAraProGln 
AspValLysPhePro GlyGlyGfyGlnlle ValGlyGlyvalTyr LeuLeuProArgArg 
GlyPPOAruValfily ValArgAIaThrArg LysThrSerGfuArg SerGlnProArgGly 
ApgApflGlnProIfe ProLysAlaArgArg ProGluGlyApgSer TrpAlaGInProGly 
TyrProTrpProleu TyrGlyAsnGluGly CysGlyTrpAlaGly TroLeuLeuSerPro 
ArgGlySerArgPro SerTrpGlyProThr AspProArgArgArg SerArgAsnLeuGly 
LysVallleAspThp LeuThrCysGlyPhe AUAspLeuHetGly TyrlleProLeuVal 
GlyAlaPPoLeuGly GlyAlaAJaArgAla LeuAlaHisGlyVal ArgValLeuGluAsp 
GlyValAsnTypAIa ThpGlyAsnLeuPro GlyCysSerPheSer IlePheLeuLeuAla 
LeuLeuSepCysLeu ThrValProAlaSer AlaValGluValArg AsnsepSepGlyVal 
TypHisValThPAsn AspCysProAsnAIa SerValValTyrGlu ThrGluSerLeuIle 
HetHlsLeuPPOGly CysValProCysVal ApgGluGlyAsnAla SerApgCysTppVal 
SepLeuSepppoThp IleAlaAlaLysAsp PpoSerValProVal SerGluMeArgApg 
HisValAspLeulle ValGlyAlaAIaAla PheCysSerAlaHet Typl/alGlyAspLeu 
CysGlySepllePhe leuValGlyGlnfle PheThrPheSerPro ArgArgHisTrpThr 
rhrGJnAspCysAsn CysSerl leTyrPpo GlyHisValThrGfy HisArgHeCAIaTrp 
AspHctMetHecAsn TrpSerProThrGly AlaLeuValHeCAIa GInLeuLeuArgI le 
ProClnAlaValVal AspHeci leAIaGly AlaHisTrpGlyVal LeuAlaGlyLeuAIa 
TyrTyrSerMecVal GlyAsnTrpAlaLys Val ValValValLeu LeuLeuPheAlaGly 
ValAspAlaAspThr GlnValThrGlyGly SerAlaAlaTyrAsp AlaArgGlyLeuAla 
SerleuPheThpPro GlyPpoLysGInAsn IleGlnLeuIleAsn ThrAsnGlySerTrp 
HisIleAsnArgrhr AlaleuAsnCysAsn GluSepLeuAsnThp GlyTrpValAlaGly 
UuPheTypTypHis LysPheAsnSerSer GlyCysProGtuArg HecAlaSerCysGIn 
ProLeuThrAlaPhe AspGInGlyTppGly Prol leThrTyrGlu GlyAsnAlaSerGly 
AspGlnApgProlyr CysTrpHrsTyrAla ProArgPpoCysGly IleValProAlaArg 

GluVal [0 048] 
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(35) 

sr 

m\<r>^^ : 3 6 5 

i£M<0S3S:Sa^ (YS117-3' amino acid) 

ThrValThrGluAla AspKeArgThrGlu GluAspLeuTyrGfn SerCysAspLeuVal 
ProGluAlaArgThr AlalieArgSerLeu ThrGluArgLeuTyr IleGlyGlyProLeu 
ThrAsnSerLysGly GlnAsnCysGlyTyr ArgArgCysArgAla SerGlyVal LeuThr 
ThrSerCysGlyAsn ThrlleTlirCysTyr LeuLysAlaSerAla AUCysArgAlaAla 
LysLeuGlnAspCys ThrHetLeuValCys GlyAs[>AspLeuVal VallleCysGluSer 
AlaGlyValGlnGlu AspAlaAlaSerLeu ArgAlaPheThrGlu AlaHetThrArgTyr 
SerAlaProProGly AspProProGlnPro GluTyrAspLeuGlu LeuIleThrSerCys 
SerSerAsnValSer ValAlaHisAspGly SerGlyLysArgVal TyrTyrLeuThrArg 
AspProGluThrPro LeuAlaArgAlaAIa TrpGluThrAlaArg HisThrProValAsn 
SerTrpLeuGiyAsn IlelleHetPheAla ProThrleuTrpVal ArgHetValLeuHet 
ThrHisPhePheSer IleLeuI leAlaGIn GluHlsLeuGluLys AlaLeuAspCvsGlu 
IleTyrQIyAlaVal HisSerValGlnPro LeuAspLeuProGlu IlelleGlnArgLeu 
HtsGlyLeuSerAia PheSerLeuHlsSer TyrSerProGlyGlu ileAsnArgValAla 
AlaCysLeuArgLys LeuGlyValProPro LeuArgAlaTrpArg HisArgAlaArgSer 
ValArgAlaThrLeu LeuSerGlnGlyGly LysAlaAlalleCys GlyLysTyrLeuPhe 
AsnTrpAlaValLys ThrLysLeuLysLeu ThrProLeuProSer AlaSerGlnLeuAsp 
LeuSerAsnTrpPhe ThrGIyGlyTyrSer GlyGlyAspIleTyr HisSerVafSerHis 
ValArgProArgTrp PhePheTrpCysLeu LeuLeuLeuSerVal GlyValGlylleTyr 
LeuLeuProAsnArg 

[0049] 



it$ffl¥6-3 1 9 5 6 3 

68 



—823— 



i 



(36) 

69 

18 

W31<r>^^ : 5 O 2 

eM<7)aS:5^« (SR037-5- a«ino acid) 

HetSenhrAsnPro LysProGlnArgLys ThrLysArgAsnThP AsnAriiArflProfiln 
AspValLysPhePro GlyGlyGlyGlnlle VafGlyGlyValTyr LeuLeuProArgArg 
GryPPoArflHetGly ValArgAlaThrApg LysThrSerGluArg SerGlnProArgGly 
ArgArgGlnProIlc ProLysAlaArgArg ProGJuGIyArgSer TrpAlaGlnProGly 
TyrProTrpProLeu TyrGlyAsnGluGly CysGIyTroAlaGly TrpLeuLeuSerPro 
ArgGlySerArgPro SerTrpGlyProSer AspProApgArgArg SerArgAsnLeuGly 
LysValUeAspThr LeuThrCysGlyPhe AJaAspLeuMecGly TyrlleProLeuVal 
GlyAIaProleuGly GlyAlaAlaArgAla LeuAlaHisClyVal ApgVaaeuGIuA5(> 
GlyValAsnTypAla ThrGlyAsnLeuPro GlyCysSerPheSer HePheLeuLeuAla 
LeuLeuSerCysLeu ThrValProAlaSer AlaValGlyValArg AsnSsrSerGlyVal 
TyrHisValThrAsn AsoCysProAsnAJa Ser Va I Va I TyrG I u ThrGluSerLeulle 
IleHlsLeuProGly CysVaJProCysVal ArgGluGlyAsnAla SerArgCysTrpVal 
SerLeuSerProThp ValAlaAlalysAsp ProSerValProVal SerGluIleArgApg 
HisValAspLeuIle ValGlyAlaAlaAla PheCysSerAlaHet TypValGlyAspLeu 
CysGlySerflePhe LeuValGlyGlnlle PheThrleuSepPro ArgArgHisTrpThr 
ThrfilfiAspCysAsn CysSerlleTyrPro GlyHisValThrGly HisArgHetAlaTrp 
AspHetMeEHecAsn TrpserPPoThrGly AlaLeuVafVafAla GInLeuLeuArglle 
ProGltiAlaValVal AspHetl leAlaGly AlaHlsTppGlyVal LeuAlaGlyLeuAla 
TypTyrSerHetVal GlyAsnTrpAULys ValValValValLeu LeuLeuPheAlaGly 
ValAspAfaGluThr GlnValSepGlycly SepAfaAIaGlnThr ThpTyrGlyLeuThp 
AlaLeuPheApgfhr GlyppoAsnGlnLys IleGlnLeulleAsn ThrAsnGlySerTrp 
HisIIeAsnArgThr AlaLeuAsnCysAsn GluSerLeuHisThP GlyTrpLeuAiaAla 
leuPheTypThrHis LysPheAsnSepSer GlyCysLeuGluApg HetAIaSepCysGIn 
PPoLeuSerAlaPhe AspGlnGlyTppGly PPOlIeThpTypGly GlyAsnAlaSerAsp 
GlyGlnApgProTyp CysTrpHisTyrAla PPoArgProCvsGly IleValPpoAlaArg 

GfuVal [0 0 50] 
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■ 1 9 

W^<r>^■^ : 3 6 5 

}SM<7ymSi : (SR037-3' amino acid) 

ThrValThrSiuAla AspIleArgThrGIu GluAspLeuTyrGIn SerCysAspLeuVal 
ProGluAlaArgThr AlalleArgSerLeu ThrGluArjjLeuTyr IleGlyfllyProLeu 
ThrAsnSerLysGly GlnAsnCysGJyTyr ArgArgCysArgAla SerGlyValLeuThr 
ThrSerCysGlyAsn Thrl I eThrCysTyr LeuLysAIaSerAla AlaCysArgAlaAla 
LysLeuArgAspCys ThrHetLeuValCys GlyAspAspLeuVal VallleCysGluSer 
AlaGlyValGlnGlu AspAlaAlaSerLeiJ ArgAlaPheThrGlu AlaNetThrArgTyr 
SerAJaProProGly AspProProGlnPro GluIyrAspLeuGIu LeuIleThrSerCys 
SerSerAsnValSer ValAlaHisAspGly AlaGlyLysArgVar TyrTyrLeuThrArg 
AspProGluThrPro LeuAlaArgAlaAla TppGluTHrAlaArg HisThrProValAsn 
SerTrpLeuGlyAsn IleUeHetPheAla ProTHrLeuTrpVaf ArgMetVai LeuHet 
TtirHisPhePheSer IleLeuIIeAlaGIn GluHisLeuGluLys AlaLeuAspCysGlu 
IleTyrGlyAlaVal HisSerlleGlnPro LeuAspLeuProGlu I lelleG/nArgLeu 
HisGlyLeuSerAla PheSerLeuHisSer TyrSerProGlyGlu IleAsnArgValAla 
AlaCysLeuArgLys LeuGlyValProPro LeuArgAlaTrDArg HisArgAlaArflSer 
ValArgAlaThrLeu LeuSerGlnGlyGly LysAlaAlalleCys GlyLysTyrLeuPhe 
AsnTrpAlaValLys ThrLysLeuLysLeu ThrProLeuProSer AfaSerGlnLeuAsp 
LeuSerAsnTrpPhe ThrGlyGlyTyrAsn GlyGlyAspfleTyr HIsSerValSerArg 
ValArgProArgTrp PhePheTrpCysLeu LeuLeuLeuSerVal GlyvalGlylleTyr 
LeuLeiProAsnArg 
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(5' 4aj:>3 3' iSA,) 


5' ag3&*^><!0fil[2 


ms 


5' -GTT(mCTAT6CCCTACGC-3' 


I>C17M-I8ir (S) 


1344 


GCAGGAGTTTGGTGATGTCA 


nt2999-3018 (A) 


1347 


TGTGTGCTCCACCCGCACn 




«3S2 


TATCGCCCCGACGCCGCACT 


nc3d64-3SS3 (A) 


1350 


TCGGACCTTTACTTGGTCAC 


nt3r23-3r42 (S) 


1349 


GGCTGAAATCGACTGTCTGG 


nt4718-4r37 (A) 


I3S4 


GTHTGOGAGAGCeTCTTCA 


n[5O09-502d (S) 


HoOO 


AuUuAL IbALbLbl CuL 1 LI 


nt6151-6l76 (A) 




TCCCCCACGCACTACGTGCC 


nC6125-6148 (S) 


1344 


GTGATGUGCCGCCCATCTC 


lit 7049- 7066 (A) 




ACAGCCAACCTCCTGTGGAG 


nt7023-7042 (S) 


f360 


CTGTGGATGCCTTGGCCTTA 


nt7S14-7833 (A) 


8335 


ACCA6GACGTGCTCAAG6AG 


nt7792-78l1 ($) 


S316 


CATAGGCTCCGT6AAG6CTC 


ntSGl 1-8630 (A) 



(S) :-tyx, (A) : ry^-bv:z 

HC-G9, YS117, sR0 3 7tsKnmmmmm9mmis. 



mm 










E2/NS 1 
NS5 

3 • mmi& 


259 

573 (1 91) 
576 (1 92) 
3 57 (1 1 9} 
1096 (365) 
6 6 


99. 6-1 00 
96. 2-97. 4 
92. 9-95. 0 
84. 3-87. 4 
96. 5-97. 6 
90. 5-92. 1 


9 9. 0-9 9. 5 
93. 2-97. 4 
7 6. 5-8 2. 4 
98. 6-98. 9 


^ If- 


2927 (867)( 94. 5-95. 6 


94. 7-96. 2 



1 cSHCVOJfiJEgj^ 





1 cSHcv(?)aiia¥ 


1 cs<?)Sc/fts^a (%) 




0/670 (0%) 
0/262 (0%) 
0/64 (0%) 
8/81 (9. 9%) 
0/60 (0%) 



[0W®JB5*;a:iftW3 50 H 1 : HC Vffle^E?ll9^^lC^1JfflL«:i9^i««^£^■r 



— 826— 



(39) 



6-319563 



75 



0 2 : H C - G 9 i: dtli T (C^«(D^aSS^J36t»W^ 

nrvi^ 1 4 0HC vttcih<^iasE5f»ia)fflHtt$^-r 



H3 :mmm2\zxfi^it^%wmu.^om^n^-y 



76 



•To lcS«<ftffl$n;^K (b) IC#3 2 1 <h# 1 0 

C-Jl. 2{iHC-J4, 3tlHC-J6. 4tJHC 
-J 8. 5ttHC-G9, 6tSYSll7. 7tiSR0 
3 7 ^^-To 



[01] 



[03] 



fG2 



NS3 



□ □ c 



] □ 
□e: 



J 



a OitHGO) ;b(ntfi3-8^ ; c(nt732-18QB) ; dM300-18S7) ;e {fitXmWi 

r 0it241-3018) ; S Git2?73-3383r :h drtSTZHTST) : i (fft4633-sa}} 

i OitsGtB-em) ; k Ortmrs-nBs) : i OitToo-BS) ; m Oit7732-«e30) 

n 0Tt8299-S4«l} 



a) 



b) 



1 2 3 4 5 6 7 
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(40) 



6-31956 



[B2] 



HCV - 



2m 



IS (1 aS) 



(1 bS} 



oS (2aM} 



(2 



HCV-1 

HCV-J1 
HCV^ 
HCtf-il/83 

HCV-J 
HC-(2 
HCV-JT 
HCV-flK 
HC\f-T 
OJX 
HCV-T 



HW-J6 



HCV-J8 



6 0% 7 0% 8 0% 9 0% 



1 0 0% 



(51) Int. CI.' sa»Ji2^ /fi^fia?*^ Fi ttifia^SiSf 

C 1 2 Q 1/68 A 7823-4B 

GO IN 33/576 Z 8310-2 J 



(41) i»M¥6-3 1 9 56 3 



// GO IN 33/53 D 8310-2 J 

C0 7K 99:00 
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